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Life stresses 


It is time for sociologists and biologists to bury the hatchet and cooperate to study the effects 


of environmental stress on how people behave. 


a vague concept that bundles together a hectic pace of life and 

the increasing pressures that come with it. To scientists, the fuzzy 
notion of stress can symbolize the even fuzzier notion of the impact 
of an environment on an individual — one side of the classic nature- 
nurture debate. 

Francis Galton, cousin of Charles Darwin and promulgator of his 
evolutionary theory, is credited with first defining the terms in this 
enduring conflict. “Nature is all that a man brings himself into the 
world; nurture is every influence which affects him after his birth,” he 
wrote in his 1874 English Men of Science: Their Nature and Nurture. 

That debate got fiery a century later, when the twentieth-century 
revolution in biology — which enabled biologists to study development 
and even behaviour at a molecular level — met the 1968-era social 
revolution. It pitted those who believed that we are determined only 
by our genes against those who believed we are determined only by our 
environment, and ignored those who pointed to the dearth of evidence 
either way. 

Sociologists built a nurture fortress, from which they criticized what 
they saw as the scientific mainstream’s biological determinism. They 
argued that the unproven and unlikely theory was dangerous because 
it could lead to social policies based on incorrect assumptions, such as 
criminals not being reformable and young minds not being vulnerable 
to stress. In energizing the debate, they did a service to both academia 
and society by keeping scientific arrogance and influence in check. 

Scientists have long since abandoned any concept of biological 
determinism. It has now been proved beyond doubt that although 
our genes are fixed, their expression is highly dependent on what our 
environment throws at us. The current challenge is to work out pre- 
cisely how environment affects our biological tissues and changes us. 

Until now, analysis of the environment has been the domain of social 
scientists, who have elaborated the correlations between environment 
and health or behaviour — for example, that a deprived upbringing 
increases the risk of deviant behaviours in adulthood. Now, biologists are 
starting to render visible how one aspect of the environment — stress — 
leaves marks on the body (see page 161). It shortens telomeres and alters 
epigenetics and brain processing — and it is most potent when it occurs 
during brain development, a surprisingly long period of time stretching 
from the third trimester of pregnancy to the end of adolescence. These 
stress-induced changes increase vulnerability to all sorts of conditions, 
including psychiatric disorders and antisocial behaviour. 

This rendering visible is starting to catch the attention of those who 
can influence social policy. For example, the influential American Acad- 
emy of Pediatrics this year published a report on what it calls “toxic 
stress’, which says that poor, or poorly coping, mothers should be cared 
for better while pregnant (A. S. Garner et al. Pediatrics 129, e224; 2012). 
Last year a report commissioned by the British government covered 
similar ground — a development highlighted by Ilina Singh from King’s 


L the modern world, ‘stress is too often used as a catch-all word, 


College London (I. Singh BioSocieties 7, 308-321; 2012). Yet she is one 
of only a handful of sociologists to notice that a new area of potential 
collaboration is opening up without their input. Science is indicating 
that stress damage can occur in the womb, and it is understandable that 
politicians will look for guidance on what to do about it. But right from 
the start, that guidance must involve sociologists, who are best placed, 
for example, to balance the potential benefits of increased monitoring 
against possible infringement of basic freedoms and rights. 

Many sociologists, however, are still 


“Only a handful immured in their fortress, struggling to catch 
of sociologists up with a debate that has shifted from nature- 
are noticing or-nurture to nature-and-nurture, or are 
that anew area unable to shake off their distrust of scientists, 
is opening up worrying that scientists will force them to play 
without their second fiddle in their own territory: the envi- 
input. sd ronment. That is a shame: both academia and 


society still need their engagement at many 
levels. Now is a perfect time for a reconciliation of the two cultures. 

Funding agencies can play an important part in this. They should be 
aware of the need for an interdisciplinary approach and encourage their 
review panels to assess projects with an open mind. Psychologists, too, 
could offer crucial insight into these efforts, and contribute their experi- 
ence of how to best to exploit the lessons of neuroscience. 

Sociologists have been studying human environments for decades, 
and have tallied the social damage that stresses such as poverty or child 
abuse can cause. Biologists are now in a position to benefit from their 
insights, although they will need to learn the language of sociology. And 
sociologists stand to benefit from the understanding that biology will 
bring to their own, vindicated, empirical research. = 


Share alike 


To make progress in clinical genomics, 
institutions must work out how to pass on data. 


any newborn babies admitted to intensive care have genetic 
Misses The symptoms can be obvious — uncontrolla- 
ble seizures, spontaneously peeling skin, abnormal heart- 

beats — but the cause often remains unknown. That uncertainty has 
painful ripples: physicians have little knowledge about how to guide 
treatment and parents are left unsure whether to have further children. 
Genome sequencing can help. Using the fastest available sequencing 
instrument and software designed to guide clinicians through analy- 
sis, a team at the Children’s Mercy Hospital in Kansas City, Missouri, 
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Share alike 


To make progress in clinical genomics, institutions must work out how to pass on data. 


10 October 2012 


Many newborn babies admitted to intensive care have genetic disorders. The symptoms can be obvious 
— uncontrollable seizures, spontaneously peeling skin, abnormal heartbeats — but the cause often 
remains unknown. That uncertainty has painful ripples: physicians have little knowedge about how to 
guide treatment and parents are left unsure whether to have further children. 


Genome sequencing can help. Using the fastest available sequencing instrument and software designed 
to guide clinicians through analysis, a team at the Children's Mercy Hospital in Kansas City, Missouri, 
reported in Science Translational Medicine last week that they had used sequencing in newborns to sift 
for rare genetic mutations that might cause disease (C. J. Saunders et al. Sci. Transl. Med. 4, 154ra135; 
2012). The results were impressive. For three of the four infants, probable culprits were identified. 


To reach these conclusions, the team considered not just genetic regions in the babies, but also those in 
their relatives and in the scientific literature. However, for sequencing to reach its medical potential, 
researchers must be able to access even more genomes. Each person has millions of genetic variants — 
or sequences that differ from the human reference genome — making it hard to find those that might 
affect health. The key is to locate variants that recur in people with similar illnesses. 


With so many sequencing projects under way, clinicians are always eager to know wether a variant has 
been observed in patients at other institutions. Analysis tools are available to help (see page 157). Yet 
there is currently no quick, reliable or convenient way to spread this information. 


: . Data sharing through scientific publication has fuelled an impressive 
“There is currently no quick, ; 
. . collection of databases that reveal frequencies of common variants. 
reliable or convenient way a 
. When variants or genes have been associated with disease, those 
to spread genomic oo 
results are also deposited in databases. ClinVar, a database from 


information.” ; ; ; 
the US National Institutes of Health (NIH), for instance, gathers 
health-related genetic variations from the literature. And the NIH has 
set money aside to create a separate resource for clinically relevant genetic variants: essentially a 


curated database of variants for which some sort of clinical action is advised. 


These are valuable efforts, but are inherently limited. Publication is too slow, and data collected about 
many variants will never be published. Researchers need to be able to query not just variants in the 
literature, but also those that have been found in other patients but not reported. 


This January, an advisory group to the UK Department of Health said that the country should create a 
centralized facility to store genomic data to improve treatments and diagnoses. However, in the United 
States, where many sequencing projects are based, regulations about sharing patient data will make 
setting up a centralized repository more difficult. 


One option would be to give patients their own sequenced genome data, letting them deposit it where 
they choose. Already, 23andMe, a consumer-genetics company in Mountain View, California, has used 
data and DNA supplied by its customers to discover (and, controversially, to patent) disease-associated 
variants. 


Another option is for medical-research institutions to agree on ways to share information with each other. 
Rather than transferring a full medical record, for example, a researcher at one institute could learn 
whether a variant had been observed in other people and, if so, what diseases they had. For this to 
work, new technology and shared platforms would have to be developed. 


There are other problems. Sequencing data are imperfect. High-throughput sequencing technology 
sometimes overlooks variants or makes other errors. Downstream issues are rife: are the benefits of 
sequencing worth the costs? Can the information be protected? How accurate are conclusions? Should 
information not related to the immediate medical question be shared wih patients, even if a diagnosis is 
uncertain or no treatment is available? 


All of these questions would be easier to answer if genomics data could provide more certainty. Yet to 
achieve this, researchers must look at the genomes and health information of more people. At this stage, 
the best path forward remains unclear. But for genomics to advance, the community must communicate. 
Institutions must consider not just wnat is best for their particular situation, but also what is best for the 
broader community. A good place to start is for staff at genomics centres and hospitals to meet in 
person, to share experiences and best practices. 
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Fighting chance 


Collaboration between geneticists and economists has the potential to bear fruit. 


10 October 2012 


One side is accused of supporting ethnic cleansing; the other of being intellectually naive. Does that 
sound like the beginning of a fruitful collaboration? Perhaps not, but read on. 


As we report on page 154, an increasingly bitter spat has emerged between geneticists and economists 
over a paper that links a country's genetic diversity to its economic development. 


At its heart, the argument boils down to cold statistics and methodological differences. A team of 
prominent geneticists and anthropologists at Harvard University in Cambridge, Massachusetts, says that 
the paper's economist authors did not properly account for historical and cultural connections between 
genetically similar countries, so correlations are mistaken for cause. 


The work is part of an emerging trend to blend economics with genetics. Daniel Benjamin, an economist 
at Cornell University in Ithaca, New York, who is trying to identify the genetic basis for economically 
relevant traits such as risk aversion, is among those who Say that the combination has yet to prove its 
worth. Nonetheless, he and others assert that understanding how genetics influences individual and 
international economies has the potential to inform policy. 


For this to happen, both sides must take seriously the standards, methodology and history of the other. 
Geneticists have spent years grappling wth the difficulties of getting useful information out of genomes. 
They have made mistakes, and learned from them, and it is naive for social scientists to think that they 
are immune from these errors, or that they can learn all they need to quickly. Benjamin says that nearly 
every study that links individual economic traits with specific genetic variants, for example, is riddled with 
false positives. 


Social scientists should also remember that human geneticists bear the historical scars of eugenics, and 
more recent accusations of insensitivity to indigenous populations. Any whiff of biological determinism will 
draw a strong response. 


Geneticists, for their part, should acknowedge that quantitative social scientists are experts in measuring 
human behaviour, both individual and collective. An entire subfield of economics, called econometrics, 
exists to make sense of data that are just as seemingly random as the string of As, Ts, Cs and Gs that 
comprises a genome. Moreover, many of the statistical methods that economists now use have their 
roots in the work of early-twentieth-century geneticists. Closer collaboration between the two fields could 


unlock the knowedge and expertise of social scientists, enabling them to draw conclusions that 
geneticists would never have conceived. 


One hopeful model is the Social Science Genetics Association Consortium, a collaboration between 
social scientists, geneticists and epidemiologists that aims to bring more rigour to the search for the 
genetic basis of economic and other behavioural traits. In addition to combining the expertise of scientists 
in disparate fields, the consortium also has access to dozens of cohorts, encompassing more than 
100,000 people. 


One of its first positive results — a genetic variant linked to educational attainment in some 2,000 
Icelanders — could not be replicated in other populations, raising questions over whether it is real. But 
the group's expertise and infrastructure gives it a chance of finding genuine links that will hopefully see 
geneticists working on follow-up studies, rather than writing angry letters. 


Nature 490, 144 (11 October 2012)  doi:10.1038/490144a 


Related stories and links 


From nature.com 
¢ Economics and genetics meet in uneasy union 
10 October 2012 


Comments 


Sorry, there was an error fetching comments for this article. 


See other News & Comment articles from Nature 


Nature ISSN 0028-0836 EISSN 1476-4687 


© 2012 Nature Publishing Group, a division of Macmillan Publishers Limited. All Rights Reserved. 
partner of AGORA, HINARI, OARE, INASP, CrossRef and COUNTER 


J. SUTTON-HIBBERT 


WORLD VIEW .yernsicos nen 


a group of experts pore over a problem and then try to come to 
a collective conclusion. I started my science-journalism career 
covering the meetings and reports of the Intergovernmental Panel on 
Climate Change (IPCC) in the mid-1990s. I’ve tried it from the other 
side too — helping to organize assessments of biotechnology policy in 
developing countries and of the state of innovation in the Islamic world. 

But expert assessments are dogged by problems, and I had begun to 
think that there was a growing reluctance to attempt them. That was 
until the University of Oxford, UK, announced in early September that 
they are taking another throw of the dice. 

Ian Goldin, an expert in globalization and director of the uni- 
versity’s Oxford Martin School, and Pascal Lamy, director-general 
of the World Trade Organization (WTO) in 
Geneva, Switzerland, have put together a panel 
of 20 experts and handed them what may be the 
mother of all remits. 

They have asked them to look into why issues 
such as tackling climate change or reducing 
trade barriers remain unresolved, and to make 
their recommendations to policy-makers by the 
middle of next year. The grandly named Oxford 
Martin Commission on Future Generations 
promises to differ from previous expert assess- 
ments because it will also interrogate how global 
issues are connected to each other. 

The panel includes Brazil’s former foreign- 
affairs minister, Luiz Lampreia; the chair of South 
Africa's planning commission, Trevor Manuel; UK 
Astronomer Royal, Martin Rees; and the economist, Nicholas Stern. 
Seeing a list of such eminent names, it is tempting to assume that the real 
work will be done bya talented but overworked secretariat. But Goldin 
promises that his panel members are not there to decorate the cover of 
the final report: they are there to generate ideas. In other words, to work. 

Goldin and Lamy have put a lot of thought into compiling their list. 
Alongside Lamy, for example, there is former World Bank president 
Robert Zoellick, who was also the US representative on the WTO at 
the same time that Lamy represented Europe as commissioner for 
trade. The pair, free from the burdens of representing Europe and the 
United States in trade talks, should be able to draw on their experi- 
ences to guide their successors. 

There is also Julia Marton-Lefévre, director-general of the Inter- 
national Union for Conservation of Nature in Gland, Switzerland, 
who will no doubt inject some environmental caution should the free- 
traders get too excited. 


Pp erhaps it’s just me, but I find something satisfying in watching 


So will the Oxford Martin Commission suc- DNATURE.COM 
ceed where others have failed? Discuss this article 
To do so, it will need to learn lessons from pre- _ online at: 


vious assessments. go.nature.com/Shlig3 


‘BIG TENT’ 


ASSESSMENTS 
SEEK CONSENSUS 
AMONG NATURAL 

FOES, BUT 
THEY CARRY 


RISKS. 


Expert assessments can 
work if lessons are learned 


For the Oxford Martin Commission on Future Generations to be heard, 
it should learn from previous mistakes, argues Ehsan Masood. 


The first of these is about the nature of its endeavour. ‘Big tent’ 
assessments seek consensus among natural foes, but in doing so, they 
carry risks. 

Goldin will know this well, having been at the World Bank during 
much of the International Assessment of Agricultural Knowledge, 
Science and Technology for Development, which started in 2002 and 
lasted for six years. An IPCC-style assessment for food with some 
4,000 participants, it attempted to bridge the interests of industry 
with those of campaign groups, particularly over the role for genetic 
modification in agriculture. The biotechnology firms Monsanto and 
Syngenta walked out shortly before the launch of the final report. 

The Oxford Martin group of 20 is a more manageable number of 
people, many of whom (although coming from different perspectives) 
are friends or know each other professionally. 
One might hope, then, that they will be able to 
resolve disagreements. 

The second lesson is the importance of com- 
municating using plain language. The final 
report of the mighty Millennium Ecosystem 
Assessment in 2005, for instance, was almost 
impenetrable to those of us in the media, and 
also to policy-makers. Unusually, Goldin’s expert 
panel features two journalists, Ariana Huffington 
of The Huffington Post and Lionel Barber of The 
Financial Times. 

But I still see difficulties ahead: in particular, 
the tension between accessibility and providing 
usable knowledge. Goldin and Lamy have cho- 
sen hard problems, the solutions of which are 
best understood and applied by experts. Can we expecta final report 
rooted in interdisciplinarity, and aimed at general readers, to be taken 
seriously by these specialists? 

Another problem is that of authority. In the heyday of global assess- 
ments, a group of experts could tell the rest of us what to think and how 
to behave and we would at least take note. That is an unfashionable 
approach today, yet nobody seems to have told the Oxford Commission. 

Itisn't too late. The Commission could webcast its meetings or cre- 
ate a rolling blog or wiki into which people can feed ideas and com- 
ment. The end result would be worth the effort, while reassuring many 
that this is not just an elite-level exercise. 

The Oxford Commissioners must also avoid the trap of focusing 
exclusively on the final report. Many of the policy processes they want 
to fix have broken down because ofa lack of trust. One thing that I’ve 
learned is that to involve your audience in the process of gathering 
knowledge is often as important as the eventual conclusion. = 


Ehsan Masood is editor of Research Fortnight and co-author with 
Daniel Schaffer of Dry: Life Without Water. He is based in London. 
e-mail: eham@researchresearch.com 
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REPRODUCTIVE BIOLOGY 


Mouse eggs from 
stem cells 


Researchers in Japan have 
turned mouse stem cells into 
egg cells that, in turn, gave rise 
to healthy offspring. 

Katsuhiko Hayashi, Mitinori 
Saitou and their colleagues 
at Kyoto University added 
signalling molecules to female 
mouse embryonic stem cells, 
and to female ‘reprogrammed’ 
or induced pluripotent stem 
cells. This process transformed 
the stem cells into primordial 
germ cells — precursors to 
eggs. Culturing the germ cells 
with embryonic ovary tissue 
gave ovary-like structures, and 
after four weeks, the precursors 
developed into egg cells. The 
team fertilized the eggs in vitro 
and transplanted the embryos 
into foster mothers; the 
resulting offspring grew up to 
be fertile themselves. 

The work could ultimately 
yield insights into potential 
treatments for infertility, the 
authors say. 

Science http://dx.doi. 
org/10.1126/science.1226889 
(2012) 

Fora longer story on this research, 
see http://go.nature.com/xuyljd 


Contemplating a 
coral comeback 


The Great Barrier Reef 
(pictured) has lost about half 
its coral over the past 27 years, 
according to data from more 
than 2,000 surveys of 214 reefs. 


Selections from the 
scientific literature 


BIOMATERIALS 


Different webs snag different prey 


A common house spider uses two different 
types of silk structure, or attachment disc, to 
anchor its cobwebs — one for webs that ensnare 
walking insects and another for those intended 


to trap flying prey. 


‘Gumfoot silk threads are connected to the 
ground and can easily release and yank walking 
prey into silk nets; ‘scaffolding’ silk is more 
securely attached to substrates and is used to 
catch flying insects. Ali Dhinojwala and his team 
at the University of Akron, Ohio, used electron 
microscopy to examine cobwebs made by the 


But addressing local threats 
could tip the balance towards 
an increase in coral cover in 
the short term, even in the face 
of continuing climate change, 
researchers say. 

Glenn Death of the 
Australian Institute of Marine 
Science in Townsville and 
his colleagues analysed the 
survey data and found that 
the mean percentage of coral 
cover declined from 28% to 
13.8% between 1985 and 2012. 
Typhoons and outbreaks of the 
native, coral-eating crown-of- 
thorns starfish (Acanthaster 
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spider Achaearanea tepidariorum, and found that 
each of these webs is anchored using a different 
type of attachment disc. Gumfoot discs (pictured 
right) consist of thin arms, whereas scaffolding 


discs (left) have thicker fibres with many more 


planci) accounted for about 
48% and 42% of the decline 
respectively. Bleaching 
related to climate change was 
responsible for 10%. 

If severe starfish outbreaks 
could be prevented, perhaps by 
reducing nutrient run-off to the 
area, coral cover could recover 
at a rate of up to 1% per year, 
the researchers say. However, 
stabilizing global temperatures 
willbe key to the long-term 
success of this strategy. 

Proc. Natl Acad. Sci. USA 
http://dx.doi.org/10.1073/ 
pnas.1208909109 (2012) 


attachment points, and stick to surfaces 10 times 
more strongly than gumfoot discs. 

Synthetic discs made from nylon strands and 
tape strips displayed similar properties to their 
natural counterparts. 

Nature Commun. http://dx.doi.org/10.1038/ 
ncomms2099 (2012) 


A genome-wide 
‘on’ switch 


A common cancer gene 
works not by implementing 
unique gene-expression 
patterns as previously 
thought, but rather by 
boosting the expression 
of almost all active genes, 
according to work from two 
independent groups. 
Increased expression 
of the gene MYCis often 
seen in human cancer, 


V. SAHNI 


M. HARVEY/GETTY 


A. DUSSUTOUR 


and generally bodes ill for 
prognosis. Richard Young at 
the Massachusetts Institute 
of Technology in Cambridge 
and his colleagues studied 
the encoded protein, c-Myc, 
in human tumour cells. The 
researchers found that c-Myc 
accumulates at the regulatory 
sequences of active genes and 
boosts their transcription into 
RNA, amplifying the gene- 
expression program already 
in place. 

Meanwhile, Keji Zhao and 
David Levens at the National 
Institutes of Health in 
Bethesda, Maryland, and their 
colleagues analysed normal 
mouse cells. They showed that 
c-Myc did not induce gene 
expression itself, but rather 
amplified the expression of 
nearly all active genes. 

This mechanism could help 
to explain the diverse effects of 
c-Myc in different cancers. 

Cell 151, 56-67; 68-79 (2012) 


How slime moulds 
keep track 


Sticky trails of sugar and 
protein secreted by slime 
moulds help the single-celled 
organisms to find their way, 
by marking where they have 
previously travelled. 
Christopher Reid at the 
University of Sydney in 
Australia and his team found 
that the slime mould Physarum 
polycephalum (pictured) — 
which senses and migrates 
towards chemical cues such 
as glucose — avoids its own 
secretions if possible, preferring 
to seek food in unexplored 
areas. When researchers used 
a U-shaped barrier to trap the 
creature en route to a glucose 
snack, 96% of organisms 
found their way out within 
120 hours, mostly by trying 
untested territory. However, 


when the entire environment 
was coated in slime, only 33% 
of slime moulds reached the 
goal in that time, and those that 
did travelled longer distances 
than organisms ina slime-free 
setting. 

Slime moulds use their 
trails to navigate complex 
environments more efficiently, 
the team concludes. 

Proc. Natl Acad. Sci. USA 
http://dx.doi.org/10.1073/ 
pnas.1215037109 (2012) 


EARTH SCIENCE 


Planetary 
wanderlust 


Earth’s continents seem to be 
drifting westward by around 
0.2 degrees every million 
years. This is a sign that the 
planet is experiencing true 
polar wander — the rotation 
of Earth's solid outer layer 
relative to its spin axis. 

To distinguish true polar 
wander from the overlapping 
motion of individual tectonic 
plates, Pavel Doubrovine and 
his colleagues at the University 
of Oslo defined a new global 
reference frame, which they 
consider to be accurate for the 
past 120 million years. To do 
so, the researchers used the 
computed tracks and present 
positions of volcanic hot spots 
in the Pacific Ocean and the 
Indo-Atlantic hemisphere. 
The rate of true polar wander 
has increased systematically 
over the past 40 million years, 
the team found. 

J. Geophys. Res. http://dx.doi. 
org/10.1029/2011JB009072 
(2012) 


FISHERIES 


Questioning tuna 
marine reserves 


Closing off areas of the Pacific 
Ocean to bigeye-tuna fishing 
might not be the most effective 
conservation measure, a 
modelling study suggests. 
Pacific populations of 
bigeye tuna (Thunnus obesus) 
are being threatened by both 
‘purse seine’ fishing, which 
involves catching large schools 
of fish with nets, and longline 
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The most viewed 
papers in science 


CHOICE 


In sport, winners do take all 


The distribution of scores and prize money 
in sport is governed by the same power 
laws that determine the distribution of city 
size or wealth: a few top-ranked players 
and teams accumulate the vast majority of the prizes. 

Wei Li of Central China Normal University in Wuhan and 
his colleagues came to this conclusion after studying ranking 
data from 12 sports, including golf, tennis and snooker. 
Detailed analysis of the tennis data revealed that when any 
two players compete, the probability that the higher-ranked 
player will win is related to the difference in the individuals’ 
rankings. Moreover, the data from all 12 sports seem to 
follow the Pareto principle, or 80-20 rule: 20% of the players 
accumulate 80% of the points and money. 


> HIGHLY READ 
on Www.njp.org 
in September 


N. J. Phys. 14, 093038 (2012) 


fishing. This led authorities 

to close two areas to purse 
seine fishing in 2009, but the 
effects of these closures have 
never been evaluated. John 
Sibert at the University of 
Hawaii in Honolulu and his 
colleagues used a model of 
tuna population dynamics to 
simulate the effects of different 
fishery-management practices 
on tuna biomass. 

The researchers found that 
closing high-seas enclaves in 
the western central Pacific 
Ocean to purse seine fishing 
had little effect on the fishes’ 
biomass. Reducing longline 
fishing in spawning areas, 
and prohibiting the use of 
fish-aggregating devices that 
increase the incidental catch of 
bigeye tuna, seem to be more 
efficient ways of maintaining 
tuna populations. 

Proc. Natl Acad. Sci. USA 
http://dx.doi.org/10.1073/ 
pnas.1209468109 (2012) 


ANIMAL BEHAVIOUR 


Echolocation for 
communication 


Animals such as bats use 
echolocation as a form of 
sonar to find food at night, 
but they might also use it to 
communicate. 

Mirjam Knérnschild at the 


University of Ulm 
in Germany and 
her colleagues 
recorded and 
analysed the 
echolocation 
calls of a social 
species of bat, 
Saccopteryx 
bilineata (pictured), at 

three sites in Costa Rica. 

The researchers found that 
roosting males seem to 

detect the echolocation calls 

of an approaching bat from 

at least five metres away. In 
response to an incoming male, 
the bats emitted aggressive 
vocalizations suggestive of 
territorial defence. If the 
approaching bat was female, 
however, the males responded 
with courtship songs. 

The males must be using 
echolocation, the authors 
conclude, because in low-light 
conditions at a distance of 
at least five metres, neither 
visual nor odour cues could 
provide the roosting bats with 
information about the sex of 
their visitor. 

Proc. R. Soc. B http://dx.doi. 
org/10.1098/rspb.2012.1995 
(2012) 
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SEVEN DAYS nescniss 


GM study slammed 
A study claiming that rats 

fed Monsanto’s genetically 
modified NK603 maize (corn) 
or its companion glyphosate- 
based herbicide, Roundup, 
showed increased incidences 
of cancer (G.-E. Séralini 

et al. Food Chem. Toxicol. 
http://doi.org/jgq; 2012) has 
been roundly criticized by 
the European Food Safety 
Authority. On 4 October, the 
agency called on the study's 
authors to share more of 
their data and said that it was 
“presently unable to regard 
the authors’ conclusions 

as scientifically sound”. 

See page 158 for more. 


Nuclear safety 
Hundreds of safety upgrades 
are needed at European 
nuclear reactors, according to 
an analysis of the continent’s 
power plants. Following 

the March 2011 Fukushima 
nuclear crisis in Japan, the 
European Commission ran 
safety tests that included all 
145 reactors at nuclear power 
plants in the European Union. 
A report on these ‘stress tests, 
published on 4 October, 
found that “practically all” 
nuclear plants need safety 
improvements. Problems 
ranged from the lack of 
back-up control rooms to 
substandard risk assessments. 
See go.nature.com/cbbiah for 
more. 


Fracking furore 

The US Environmental 
Protection Agency (EPA) said 
last week that data provided 
by the US Geological Survey 
were consistent with its 
December 2011 finding that 
the use of hydraulic fracturing 
— or ‘fracking’ — to extract 
natural gas had contaminated 
groundwater near Pavillion, 
Wyoming. An independent 
analysis commissioned by 


Mammoth unearthed from Siberian mud 


A remarkably well-preserved 30,000-year-old 
mammoth was revealed on 4 October, after 

an 11-year-old-boy had spotted its limbs 
jutting out from the frozen mud of the Taymyr 
peninsula in northern Russia earlier this year. 
Researchers excavated the 500-kilogram 


environmental groups and 
published on 3 October 
supported the EPA's 
assessment, but industry 
officials continue to 

question the source of the 
contamination. See go.nature. 
com/xzorhp for more. 


UK funding boost 
The UK government 

will add £200 million 
(US$321 million) toa 

fund to promote research 
partnerships between 
universities and industry, 
which the government says 
has been oversubscribed since 
it launched with £100 million 
in June. The announcement 
on 8 October was welcomed 
by scientists, although the 
Campaign for Science and 
Engineering, a London-based 
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lobby group, noted that 
science investment is yet to 
recover from budgets slashed 
in 2010. 


Chemistry lawsuit 
The American Chemical 
Society, the world’s largest 
scientific society, announced 
on 5 October that it would pay 
US$22.6 million to Leadscope, 
a chemical-information 
company in Columbus, Ohio, 
to settle a ten-year lawsuit. 
The Ohio Supreme Court 

had found on 18 September 
that the society’s 2002 

lawsuit against Leadscope 

was “objectively baseless” 

(see Nature 489, 482-483; 
2012) and had been filed 
unfairly to crush a competitor 


carcass from the Siberian tundra in September, 
and it was formally identified earlier this 
month. Researchers at the Zoology Institute 

in St Petersburg said that its DNA was badly 
damaged, probably making it useless for 
cloning (see Nature 456, 310-314; 2008). 


to its Chemical Abstracts 
Service, which brings in some 
$300 million a year to the 
non-profit society. 


Dystrophy drug hope 
The experimental drug 
eteplirsen may help patients 
with Duchenne muscular 
dystrophy (DMD), a 

clinical trial of 12 boys with 
the condition reported 

on 3 October. Scientists 

at biotech firm Sarepta 
Therapeutics in Cambridge, 
Massachusetts, which makes 
the drug, revealed that four 
boys who had taken a high 
dose of eteplirsen for nearly 
a year were able to walk an 
average of 21 metres farther 
in six minutes than at the 
start of the trial. (Those on 

a placebo showed a decline 


PHOTAS/TASS/PA 
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SOURCE: CIMMYT/FAOSTAT 


of 68 metres.) The company 
plans to file for regulatory 
approval with the US Food 
and Drug Administration. 
About 1 in 3,600 boys develops 
DMD, which is caused by 
mutations in a gene on the 

X chromosome and eventually 
leads to paralysis and death. 


Private spaceflight 


California firm SpaceX 
launched its first mission to 
resupply the International 
Space Station on 7 October, 

a milestone in commercial 
spaceflight. As Nature went 
to press, the Dragon craft was 
due to dock with the space 
station on 10 October. The 
launch saw one engine fail, 
but the craft reached orbit. 
SpaceX, based in Hawthorne, 
said that, with nine engines, its 
Falcon 9 rocket was designed 
to handle such a problem. 

See go.nature.com/rvdn4f for 
more. 


PEOPLE 
Fraud funding probe 


Diederik Stapel, a social 
psychologist formerly based 
at Tilburg University in 

the Netherlands, is under 
investigation by Dutch 
authorities, according to 
newspaper reports. Stapel 
resigned from his post last 
year after he was found to 
have fabricated data in some 
30 published papers (see 
Nature 479, 15; 2011). The 


TREND WATCH 


An analysis of agricultural 
potential in 12 African countries, 
released on 9 October, suggests 
that farmers are making use of 
just 10-25% of the land where 
wheat can be grown profitably 


without irrigation. Africa imports 


wheat because production lags 
behind consumption (see chart), 
but on the basis of modelling 

by the International Maize and 
Wheat Improvement Center 

in Texcoco, Mexico, the report 


suggests that farmers could boost 
domestic yields even in the face of 


global warming. 


new investigation will seek 
to determine whether Stapel 
defrauded the government 
by misappropriating grant 
money. 


Nobel prizes 

This year’s Nobel Prize in 
Physiology or Medicine 

went to stem-cell experts 

John Gurdon and Shinya 
Yamanaka, for their work on 
reprogramming mature cells 
into their embryonic state. The 
physics prize was won by Serge 
Haroche and David Wineland, 
for their experiments in 
quantum optics. See pages 151 
and 152 for more. Nature went 
to press before the chemistry 
prize was awarded, but full 
details will be available at 
go.nature.com/5yjkul. 


Genius grants 

The MacArthur Foundation, 
headquartered in Chicago, 
Illinois, selected its fellows 
for 2012 on 1 October. 
Fellowships are worth 
US$500,000 over 5 years, 


and this year, 10 out of 23 
recipients are scientists, 
including microbiologist 
Sarkis Mazmanian, 
astronomer Olivier Guyon 
and marine ecologist Nancy 
Rabalais. The awards, 
popularly known as genius 
grants, come with no strings 
attached as to how the money 
is spent. See go.nature.com/ 
ru2vgy for more. 


Chemical spill 


South Korea's government 
has designated the area 
around a chemical spill in the 
southeastern city of Gumi 

a special disaster zone. The 

8 October announcement 
came nearly 2 weeks after an 
explosion at the Hube Globe 
chemical plant released some 
8 tonnes of hydrofluoric acid. 
Five workers were killed, 
according to the Yonhap 
news agency. More than 3,000 
people have since been treated 
after inhaling acid fumes, and 
the leak has damaged crops 
and livestock. See go.nature. 
com/yqjpnf for more. 


| ERESEARCH 
Telescope array 


One of the world’s most 
powerful radio-telescope 
arrays, the Australian 
Square Kilometre Array 
Pathfinder, was officially 
opened on 5 October at the 


AFRICAN WHEAT PRODUCTION LOAFING 


Wheat consumption in Africa has increasingly outstripped 
domestic production of the grain over the past 50 years. 
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SEVEN DAYS | THIS WEEK | 


13-17 OCTOBER 
The Society for 
Neuroscience meets in 
New Orleans, Louisiana. 
Featured topics include 
the changing ecosystem 
of global neuroscience, 
with collaborative 
efforts and ‘big data’ 
coming to the fore. 
www.sfn.org/am2012 


14-19 OCTOBER 
New results from 

the Curiosity rover 

on Mars, and from 

the Kepler mission 
searching for extrasolar 
planets, are announced 
at the meeting of the 
American Astronomical 
Society's Division for 
Planetary Sciences in 
Reno, Nevada. 
www.psi.edu/dpst2 


Murchison Radio-astronomy 
Observatory in Western 
Australia. Composed of 

36 antennas, each 12 metres 
in diameter, the telescope 
will map black holes and take 
a census of local galaxies, as 
well as testing out technology 
for a larger project in which 
it is due to be involved: the 
Square Kilometre Array, 

split between Australia and 
South Africa. 


A year in space 

Two astronauts — one 
American and one Russian — 
will stay on the International 
Space Station for an entire 
year in a mission beginning 
in spring 2015, NASA said 

on 5 October. Space-station 
missions are usually restricted 
to six months. The mission 
will collect more data about 
how humans react to long 
stays in space. But one year 

is not a record: Russian 
cosmonaut Valery Polyakov 
spent 437 days in space on the 
Mir space station in 1994-95, 
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Cell rewind wins medicine Nobel 


Researchers awarded prestigious prize for their work on reprogramming mature cells to a 
pluripotent state. 


Alison Abbott 


08 October 2012 


The discovery that cells can be reprogrammed to an 
embryonic-like state has won this year’s Nobel Prize in 
Physiology or Medicine for two leading lights of stem- 
cell research: John Gurdon and Shinya Yamanaka. 


Reprogrammed cells regain pluripotency, the potential 
to differentiate into many mature cell types. Many 
researchers hope that cells created in this way will 


eventually be used in regenerative medicine, providing . Lh > go came 
nF pe | @ 4 


replacement tissue for damaged or diseased organs. 
John Gurdon (left) and Shinya Yamanaka showed 


The field has become one of the hottest in biology, but 
how to reprogram cells into their embryonic 


states. 
J. PLAYER/REX FEATURES; AFLO/REX FEATURES 


the prizewinners’ discoveries were not without 
controversy when they were made. 


Gurdon, who is based at the Gurdon Institute in Cambridge, UK, was the first person to demonstrate that 
cells could be reprogrammed, in work published 50 years ago?. At the time, scientists believed that 
cellular specialization was a one-way process that could not be reversed. Gurdon overturned that dogma 
by removing the nucleus from a frog egg cell and replacing it wth the nucleus from a tadpole’s intestinal 
cell. Remarkably, the process was able to turn back the cellular clock of the substitute nucleus. Although 
it had already committed to specialization, inside the egg cell it acted like an egg’s nucleus and directed 
the development of a normal tadpole. 


Gurdon was a graduate student at the University of Oxford, UK, when : 
; Related stories 
he did the work. He received his doctorate in 1960 and went on todoa 


postdoc at the California Institute of Technology in Pasadena, leaving = SIEM Cell MONEE Daas 


his frogs in Europe. He did not publish the research until two years BInItte aE tapies 


after he got his PhD, once he was sure that the animals had matured ¢ Stem cells: Fast and 
healthily. “I was a graduate student flying in the face of [established] furious 
knowedge,” he says. “There was a lot of scepticism.” e Stem cells: 5 things to 


know before jumping on 
Mammalian cells did not prove as amenable to this process, known as the iPS bandwagon 


cloning by nuclear transfer, as frog cells. It was nearly 35 years before are: PeIniod Saas 
the first cloned mammal — Dolly the sheep — was born, in 1996. Dolly 


was the only live birth from 277 attempts, and mammalian cloning remained a hit-and-miss affair. 


Scientists were desperate to improve the efficiency of the system and to understand the exact molecular 
process involved. That is where Shinya Yamanaka of Kyoto University, Japan, made his mark. Yamanaka 
— who was born the year that Gurdon published his formative paper — used cultured mouse cells to 
identify the genes that kept embryonic cells immature, and then tested whether any of these genes could 
reprogram mature cells to make them pluripotent. 


In the mid-2000s, the stem-cell community knew that Yamanaka was close. “I remember when he 
presented the data at a 2006 Keystone symposium,” says Cédric Blanpain, a stem-cell biologist at the 
Free University of Brussels. “At that time he didn’t name them and everyone was betting what these 
magic factors could be.” 


A few months later, attendees at the 2006 meeting of the International Society for Stem Cell Research in 
Toronto, Canada, packed out Yamanaka’s lecture. The audience waited in silence before he announced 
his surprisingly simple recipe: activating just four genes was enough to turn adult cells called fibroblasts 
back into pluripotent stem cells*. Such induced pluripotent stem (iPS) cells could then be coaxed into 
different types of mature cell types, including nerve and heart cells. 


Gurdon will be 80 next year, but he continues his laboratory work on the molecular basis of 
reprogramming in frogs. With his mop of floppy hair and wry sense of humour, he is regarded by 
colleagues as a typical English gentleman, running his research institute with friendly collegiality. Gurdon 
sometimes notes that the honour of having an institute named after him — it was previously known as the 
Wellcome Trust/Cancer Research UK Institute — is usually accorded only to the dead, something 
colleagues can only smile at. “John is very much the active scientist,” says Azim Surani, a principal 
investigator at the institute. 


Yamanaka, who has just turned 50, is equally well-regarded by colleagues, who describe him as smartly 
dressed, polite and meticulous. In an interview with the Nobel Foundation, based in Stockholm, he said 
that the phone call informing him that he had won the award had interrupted him as he was cleaning the 
house. Yamanaka’s research has won huge backing from the Japanese government, which now funds a 
large research centre for him at his university? and has agreed to support a stem-cell bank for clinical 
use*. Yamanaka began his career as a surgeon, but, he said, “I had no talent for it, so | decided to 
change my career from clinics to laboratories”. “But | still feel that | am a physician — my goal, all my life, 
has been to bring stem-cell technologies to clinics.” 


Both scientists are aware that translating their discoveries into regenerative therapies will take its own 
time. “That’s why it is so important to support basic science — it often happens that therapeutic benefit 
comes quite a long time after the initial discovery,” Gurdon told the Nobel Foundation. 


Nature 490, 151-152 (09 October 2012)  doi:10.1038/490151a 
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Physics Nobel for quantum optics 


Award for methods that ‘revolutionized’ atomic physics. 


Geoff Brumfiel 


09 October 2012 


As delicate as gossamer, the quantum properties of 
particles are apt to disappear as soon as physicists try 
to measure them. But it is possible to build a window on 
the quantum world to reveal these properties: and for 
that, Serge Haroche of the College of France, Paris, 
and David Wineland of the National Institute of 


Standards and Technology in Boulder, Colorado, have 


Serge Haroche (left) and David Wineland 
discovered ways to probe the quantum states of 


been awarded an equal share of this year’s Nobel Prize 
in Physics. 


particles. 


Haroche uses atoms as a sensitive probe of light C. LEBEDINSKY/CNRS PHOTOTHEQUE; NIST 


particles trapped in a cavity, whereas Wineland takes 

the opposite approach, using light to measure the quantum states of atoms. Both techniques have 
helped to investigate the fundamentals of quantum mechanics, and they are helping to develop new 
technologies such as quantum computers or atomic clocks of dizzying precision. News of the award came 
as a shock to Haroche: “I recognized the Swedish phone code. | had to sit down,” he said at a press 
conference shortly after the announcement. 


In the quantum world, particles of light and matter obey strange rules. One particle can occupy several 
mutually exclusive states simultaneously, for example, and groups of particles can be mysteriously 
connected through a process known as entanglement. But these quantum properties are hard to see: 
particles will show their quantum nature only in isolation, and even the slightest bump from the outside 
world will destroy their quantum states. That makes experiments extremely tricky, because the act of 
measuring itself is enough to upset the system. The techniques developed by Wineland and Haroche 
gave physicists a way to probe these states without destroying them. 


Haroche’s experiments bounce microwave photons between a pair of : 
_ Related stories 
superconducting mirrors, and send a stream of rubidium atoms through 


e Cell rewind wins medicine 
Nobel 


the fog of photons. By measuring the spins of the atoms as they enter 
and exit the mirrored cavity, he is able to indirectly probe the quantum 
properties of the microwave photons inside. Progressive « Physics: Quantum all the 


measurements have, for example, allowed his team to observe a way 


photon’s quantum wavefunction — which simultaneously describes all ¢ Quantum physics: Wave 
of its possible quantum states — and then monitor its collapse to a goodbye 
single, well-defined state. 

More related stories 
Wineland’s group traps beryllium ions in electric fields, and cools them 
with a laser that excites the ion’s electrons. This sucks vibrational energy from the system, lowering the 
temperature. Researchers can then use lasers to alter vibrations between the ions, allowing them to 
control the quantum interactions in the system?. The work is already being used to build atomic clocks 
with unprecedented accuracy, says Immanuel Bloch, a physicist at the Max Planck Institute for Quantum 
Optics in Garching, Germany. Further down the line, these techniques could be used in a quantum 
computer — a device that can perform calculations using the probabilistic rules of quantum mechanics. 


The award is “a great choice of two people woo have really contributed to the foundations of quantum 
physics”, Bloch says. He notes that this is just the latest in a run of Nobel prizes for quantum optics. Bloch 
thinks that this is down to the myriad techniques, such as those of Wineland and Haroche, that are 
allowing researchers to isolate, study and manipulate increasingly complex quantum systems. “I think 


we’ve really seen atomic physics revolutionized,” he says. 
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Annie Dookhan, a former forensic chemist in Massachusetts, has been charged with obstructing justice. 
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Boston scandal 
exposes backlog 


Chemist charged with fraud shows system under pressure. 


BY EUGENIE SAMUEL REICH 


r | AVhe numbers are shocking: 1,100 people 
are in jail on the basis of evidence that 
may have been tampered with, and 

34,000 criminal cases could be affected. The 

fallout stems from the work of just one foren- 

sic chemist, who is alleged to have faked test 
results on suspected drug samples. 

The revelations and the subsequent arrest 
on 28 September of Annie Dookhan, who 
worked at a state-run laboratory in Boston, 
Massachusetts, have rattled the US foren- 
sics community and raised questions about 
oversight across the profession. Nature has 
learned that the facility had applied for fed- 
eral funding to help to clear a backlog of some 
8,000 cases — a full year’s work. The situation 
parallels that of overburdened forensic labs 
across the United States. 

“If you think this couldn't happen in your 
lab, you're naive,’ says Robin Cotton, director 
of the forensic-sciences programme at Boston 
University and a former forensic-lab director. 
“Lab directors all over the country are pay- 
ing attention.” Cotton and others say that the 
affair highlights a resourcing problem that is 


putting pressure on US forensic-lab work- 
ers and their supervisors, who need to bring 
cases to court fast, but without compromising 
the integrity of test results. 

Dookhan has pleaded not guilty to charges 
of obstructing justice by falsifying data in two 
criminal cases, and of lying under oath about 
her qualifications. However, a wide-ranging 
confession by Dookhan documented in a 


SYSTEM OVERLOAD 


Demands on US forensic labs have expanded — 
and so has the proportion of backlogged cases. 
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police report published by The Boston Globe 
implies that she falsified results on numerous 
occasions over the past 2-3 years. In most 
cases, she had guessed the nature of samples 
she had been asked to test; but “a few times” 
she had recorded the results of negative drug 
tests as being positive. Because Dookhan 
wasn't able to tell police which samples she 
had faked, and has said that she sometimes 
contaminated samples after the fact so that 
they would conform to her guesses, the pos- 
sibility of wrongful conviction now swirls 
around the tens of thousands of cases on 
which she worked. Those who are currently 
serving jail terms are set to have their cases 
reviewed in court hearings that will begin on 
15 October. 

Dookhan’s role at the lab was to identify 
compounds found by police that were sus- 
pected to be drugs. Police interviews with 
her former colleagues reveal that she tested 
more samples in less time than any other 
chemist in the lab — a record of productivity 
that raised eyebrows and had generated com- 
plaints from co-workers. In one instance, a top 
official reacted to concerns by giving Dookhan 
a special project to try to “slow her down”. 

Dookhan resigned in March while facing 
termination proceedings. The director of ana- 
lytical chemistry has been fired, the lab bureau 
chief has resigned and Dookhan’s direct 
supervisor faces disciplinary proceedings. 
Massachusetts public-health commissioner 
John Auerbach has also resigned, taking 
responsibility for the loss of confidence in the 
state's forensic work as a result of the scandal. 


RESOURCE PROBLEM 

Forensic labs in the United States are under a 
great deal of pressure to keep up with increas- 
ing case loads. In Massachusetts, for example, 
Nature has found that the National Institute of 
Justice (NIJ) in Washington DC allocated the 
state police at least US$1.2 million from 2009 
and 2011 to try to clear backlogs, in part by 
hiring an extra drug chemist and by buying 
new equipment. A spokeswoman for the NIJ 
has confirmed that some of the funding was 
routed to the lab in which Dookhan worked. 
The grants came with a requirement to pro- 
vide oversight and report on any allegations 
or suspicions of wrongdoing that might affect 
the integrity of forensic science, but the NIJ 
says that nothing had been reported by state 
police by the end of 2011. 

Censuses of US forensic labs show that as 
many as one in four requests by law enforce- 
ment for forensic work are not acted on within 
30 days. That has led to an ever-increasing 
backlog that accounts for a growing propor- 
tion of the workload (see ‘System overload’). 
The backlog is worst for DNA testing, which 
has become a major focus for groups interested 
in wrongful convictions. However, with drug 
law enforcement a priority in the US justice 
system, labs that test drugs are also being > 
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> swamped with cases. 

Interviews documented in the police report 
suggest that a key cause of the backlogging at 
the lab is a 2009 US Supreme Court decision 
known as Melendez-Diaz v. Massachusetts, 
which overturned a drug conviction because 
the defence had been refused permission to 
cross-examine the forensic scientists who 
tested samples seized from a suspect. The 
result means that defence attorneys are now 
more likely to call forensic scientists into 
court to testify. “They're spending all their 


time in the courtroom and not the laboratory,’ 
says Ralph Keaton, executive director of the 
American Association of Crime Laboratory 
Directors Laboratory Accreditation Board in 
Garner, North Carolina. “Then the backlog 
grows.” 

A bill now before the US Congress aims to 
improve the quality of forensic science by pro- 
viding funding for research and for develop- 
ment of national standards. But the bill does 
not include funding to clear backlogs. Stephen 
Saloom, policy director at the Innocence 


Project in New York, which seeks to uncover 
wrongful convictions, says that even if the bill 
passes it won't provide an immediate remedy 
for cases of deliberate evidence tampering. 

Dookhar’s attorney, Nicolas Gordon, says 
that his client is not speaking to the media. 
Dookhan has been released on bail until 
a court hearing on 3 December. Gordon 
acknowledges allegations against his client but 
won't comment on their veracity. “It’s a fluid 
situation that could change over the next few 
months,” he notes. = 


INTERDISCIPLINARY RESEARCH 


Fconomics and genetics 
meet in uneasy union 


Use of population - genetic data to predict economic success sparks war of words. 


BY EWEN CALLAWAY 
CC he invalid assumption that 
correlation implies cause is probably 
among the two or three most serious 
and common errors of human reasoning.” 
Evolutionary biologist Stephen Jay Gould was 
referring to purported links between genetics 
and an individual’s intelligence when he made 
this familiar complaint in his 1981 book The 
Mismeasure of Man. 
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The United States has the right amount of genetic diversity to buoy its economy, claim economists. 


Fast-forward three decades, and leading 
geneticists and anthropologists are levelling a 
similar charge at economics researchers who 
claim that a country’s genetic diversity can 
predict the success of its economy. To critics, 
the economists’ paper seems to suggest that 
a country’s poverty could be the result of its 
citizens’ genetic make-up, and the paper is 
attracting charges of genetic determinism, 
and even racism. But the economists say that 
they have been misunderstood, and are merely 
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using genetics as a proxy for other factors 
that can drive an economy, such as history 
and culture. The debate holds cautionary les- 
sons for a nascent field that blends genetics with 
economics, sometimes called genoeconomics. 
The work could have real-world pay-offs, such 
as helping policy-makers to set the right level of 
immigration to boost the economy, says Enrico 
Spolaore, an economist at Tufts University near 
Boston, Massachusetts, who has also used 
global genetic-diversity data in his research. 

But the economists at the forefront of this 
field clearly need to be prepared for harsh 
scrutiny of their techniques and conclusions. 
At the centre of the storm is a 107-page paper 
by Oded Galor of Brown University in Provi- 
dence, Rhode Island, and Quamrul Ashraf of 
Williams College in Williamstown, Massa- 
chusetts’. It has been peer-reviewed by econo- 
mists and biologists, and will soon appear in 
American Economic Review, one of the most 
prestigious economics journals. 

The paper argues that there are strong links 
between estimates of genetic diversity for 
145 countries and per-capita incomes, even 
after accounting for myriad factors such as eco- 
nomic-based migration. High genetic diver- 
sity in a country’s population is linked with 
greater innovation, the paper says, because 
diverse populations have a greater range of 
cognitive abilities and styles. By contrast, low 
genetic diversity tends to produce societies 
with greater interpersonal trust, because there 
are fewer differences between populations. 
Countries with intermediate levels of diversity, 
such as the United States, balance these factors 
and have the most productive economies as a 
result, the economists conclude. 
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The manuscript had been circulating on the 
Internet for more than two years, garnering 
little attention outside economics — until last 
month, when Science published a summary 
of the paper in its section on new research in 
other journals. This sparked a sharp response 
from a long list of prominent scientists, 
including geneticist David Reich of Harvard 
Medical School in Boston, Massachusetts, and 
Harvard University palaeoanthropologist 
Daniel Lieberman in Cambridge. 

In an open letter, the group said that it is 
worried about the political implications of the 
economists’ work: “the suggestion that an ideal 
level of genetic variation could foster economic 
growth and could even be engineered has the 
potential to be misused with frightening con- 
sequences to justify indefensible practices such 
as ethnic cleansing or genocide,’ it said. 

The critics add that the economists made 
blunders such as treating the genetic diversity 
of different countries as independent data, 
when they are intrinsically linked by human 
migration and shared history. “It’s a misuse 
of data,” says Reich, which undermines the 
paper’s main conclusions. The populations of 
East Asian countries share a common genetic 
history, and cultural practices — but the 
former is not necessarily responsible for 
the latter. “Such haphazard methods and 
erroneous assumptions of statistical independ- 
ence could equally find a genetic cause for 
the use of chopsticks,” the critics wrote. 

They have missed the point, responds Galor, 
a prominent economist whose work examines 
the ancient origins of contemporary economic 
factors. “The entire criticism is based on a 
gross misinterpretation of our work and, in 
some respects, a superficial understanding of 
the empirical techniques employed,” he says. 
Galor and Ashraf told Nature that, far from 
claiming that genetic diversity directly influ- 
ences economic development, they are using 
it as a proxy for immeasurable cultural, his- 
torical and biological factors that influence 
economies. “Our study is not about a nature 
or nurture debate,’ says Ashraf. 

“Tt seems like the devil is in the interpre- 
tation more than the actual application of 
the statistics,” says Sohini Ramachandran, a 
population geneticist at Brown University who 
provided the genetic data for the study. She 
adds that Galor and Ashraf used estimates of 
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genetic diversity that she and her colleagues 
specifically developed to overcome many 
of the confounding factors caused by the 
overlapping genetic and cultural histories of 
neighbouring countries. 

Galor and Ashraf are not the first economists 
to use genetic-diversity data. Spolaore has also 
found that the differences in genetic diversity 
between countries can predict discrepancies in 
their level of economic development’. But he 
is clear that this is not necessarily a causal rela- 
tionship: “In my view it’s not genetic diversity 
itself that is responsible for this correlation,’ he 
says. “A lot of this could be culture” 

Some say that the field needs a dose of 
rigour. Many studies linking genetic variation 
to economic traits 
make basic method- 
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a behavioural econo- 
mist at Cornell Uni- 
versity in Ithaca, New 
York. He is part of the Social Science Genet- 
ics Association Consortium, a group that 
brings together social scientists, epidemiolo- 
gists and geneticists to improve such studies. 
Problems that medical geneticists have known 
about for years — such as those stemming 
from small sample sizes — crop up all too 
often when economists start to work with 
the data, he says. 

For instance, while searching for genetic 
associations with factors such as happiness 
and income in a study of 2,349 Icelanders, 
Benjamin and his colleagues found a stati- 
stically significant association between 
educational attainment and a variant in a gene 
involved in breaking down a neurotransmit- 
ter molecule’. But the researchers could not 
replicate this association in three other popu- 
lation samples — a test for false positives that 
is standard practice in medical genetics — and 
the team now has reservations about the associ- 
ation. Ifthe field is to develop fruitfully, “I think 
it’s essential for us to have geneticists involved’, 
says Benjamin. “We couldnt do it without their 
help and insight.” m SEE EDITORIAL P.144 
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Genome interpreter vies for 
place in clinical market 


Launch of system that keeps data local aims to address privacy fears. 


BY MONYA BAKER 


s DNA sequencing gets faster and 

cheaper, clinicians are clamouring to 

use it. A test for malfunctioning genes 
might show how to treat a tumour or help to 
diagnose the underlying causes of a disease. 
But sequencing data are too complex for most 
clinicians to analyse, and medical institutions 
are wary of transferring patient data to special- 
ists elsewhere for analysis. 

A genome-interpretation company is now 
offering its solution: a 1-metre-tall, 275-kilo- 
gram black box that carries enough storage and 
processing power to analyse one genome every 
day, picking out mutations with potential links 
to disease — in theory, fast enough to inform 
treatment. But for some, the most important 
feature of the US$125,000 unit is that it is a self- 
contained object. In an era of cloud comput- 
ing and global networks, a machine that keeps 
its information stubbornly local has growing 
appeal. “There is a tremendous worry about 
privacy with sharing patient data,’ says Martin 
Tolar, chief executive of Knome, the company 
in Cambridge, Massachusetts, that produces 
the device. “The institutions we approached 
said, ‘We want to keep the system within our 
four walls.” 

Unveiled on 27 September, the Knome sys- 
tem, knoSYS 100, belongs to an emerging class 
of services and tools to help clinical researchers 
to catch up with advances in genome sequenc- 
ing. That capability “has made itself available 
faster than we are prepared to deal with’, says 
Vincent Funari, director of the genomics core 
facility at Cedars-Sinai Medical Center in Los 
Angeles, California. 

The core of Knome’ system is not hardware, 
but software. The machine combs through a 
newly sequenced genome to find places where 
humans vary, and annotates them with existing 
knowledge. This process, known as genome 
interpretation, can winnow down the millions 
of variants found in any individual’s genome 
to a handful that might explain a disease (see 
‘Super sifter’). “Our goal is to take these data 
and say, ‘For this group 


of patients, these arethe NATURE.COM 
5-10 variants that are Readmore about 
most likely to be impli- _ whole genomes in 
cated,” explains Tolar. _ theclinicat: 

For now, the software is _ go.nature.com/yv7rls 


SUPER SIFTER 


Genome-interpreting tools work by discarding genetic differences that look inconsequential or lack 
information, and focusing on a handful likely to be linked to disease. (Schematic is simplified; tools vary.) 


INPUT: Sequenced genome (~6 billion base pairs) 


Variants identified 


Compare with reference genome and known variation 


3 million to 
5 million variants 


Filter for variants in genes >20,000 variants 

Compare with genetic databases and genomes of other ill or healthy ~1,000 variants 

people to select rare variants and those associated with disease 

If no Known causative variant found, run through algorithms to ~100 variants 

predict variants likely to disrupt gene function 

Use existing knowledge to find variants most likely to cause disease ~10 variants 
OUTPUT: For follow-up analysis ~10 variants 


not meant for clinical diagnosis or medical 
advice. Clients include drug companies 
and medical centres researching how to use 
sequencing for clinical decisions. 

Experts warn that genome interpretation is 
far from mature, and that its reliability depends 
on the quality of the sequences it analyses. Not 
all types of variants can be detected, and errors 
occur at every step before and during interpre- 
tation: in the sequencing of fragments of DNA; 
in matching those millions of fragments with 
their equivalents in the reference genome; and 
in detecting differences. “The interpretation of 
variants is absolutely dependent on accurate 
variant identification,’ says Karl Voelkerding, 
medical director for genomics and bioinfor- 
matics at ARUP Laboratories, an assay facility 
in Salt Lake City, Utah. 

Then there is the challenge of working out 
which detected variants are relevant to disease. 
The protocols are imperfect and the various 
annotation tools access different data in dif- 
ferent ways and so supply a variety of answers. 
And all annotation tools uncover many ‘vari- 
ants of unknown significance, about which too 
little is known to assess whether or not they 
affect a person's health. Any variant that might 
be used to provide a diagnosis or guide patient 
care must be verified independently by sepa- 
rate experiments. 

But just organizing information into a useful 
form is a big step forward, says George Church, 
a genomicist at Harvard Medical School in 
Boston, Massachusetts, and co-founder of 
Knome. The process, he says, “is not about 


perfection. It’s about delivering a high-quality 
interpretation based on current knowledge.” 

Knome’s device may be well placed to tap a 
nascent clinical market in which data is pref- 
erentially kept on site, but sequencing com- 
panies are also making inroads with software 
that requires genomic data to be transferred 
elsewhere. For example, Illumina, based in San 
Diego, California, offers free data storage and 
variant identification for clients who upload 
sequencing data to its cloud-computing plat- 
form, which has an open programming inter- 
face. lumina has contracts with a suite of other 
companies to develop data-analytic applica- 
tions in the cloud. As more centres begin using 
sequencing data, it is expected that policies and 
procedures for using the cloud will mature. 

David Goldstein, a genomicist at Duke 
Institute for Genome Sciences and Policy in 
Durham, North Carolina, says that although 
commercial genome interpreters may be 
valuable in the clinic, he doubts that they will 
make headway in the larger genomics research 
centres. Commercial offerings rely on stand- 
ard references and techniques that can make 
their approach too constraining for genomics 
research, he says. “We've never had a 12-month 
stretch where we've done something the same 
way. 

But for clinicians interested more in working 
out how best to apply current genomics knowl- 
edge, prepackaged software and hardware may 
be just the ticket. “Last year we had zero clini- 
cal institutes” as clients, says Tolar, “and this 
year we have 20.” m SEE EDITORIALP.143 
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Hyped GM maize study 
faces growing scrutiny 


Food-safety bodies slam feeding study that claims increased cancer incidence in rats. 


BY DECLAN BUTLER 


r | Nhe storm of scientific criticism over 
claims that a genetically modified 
(GM) maize causes severe disease in 
rats shows no signs of abating. 

Gilles-Eric Séralini, a molecular biologist at 
the University of Caen, France, is under 
intense pressure to report the full data 
behind his team’s finding that rats fed for 
two years with Monsanto’s glyphosate- 
resistant NK603 maize (corn) developed 
many more tumours and died earlier 
than controls (see Nature 489, 484; 2012). 
The study, run in collaboration with the 
Paris-based Committee for Research and 
Independent Information on Genetic 
Engineering (CRIIGEN), also found that 
rats developed tumours when their drink- 
ing water was spiked with glyphosate, 
the herbicide that is used with the GM 
maize. The findings have had a huge pub- 
lic impact in Europe, empowering those 
opposed more broadly to GM foods, and 
leading some politicians to call for tighter 
regulations or outright bans of the maize. 

Last week, the European Food Safety 
Authority (EFSA) in Parma, Italy, and 
Germany’s Federal Institute for Risk 
Assessment (BfR) in Berlin both issued 
initial assessments slamming the paper, bluntly 
asserting that its conclusions are not supported 
by the data presented. “The design, reporting 
and analysis of the study, as outlined in the 
paper, are inadequate,’ says the EFSA in a press 
release, adding that the paper is “of insufficient 
scientific quality to be considered as valid for 
risk assessment”. 

The biggest criticism from both reviews is 
that Séralini and his team used only ten rats 
of each sex in their treatment groups. That is 
a similar number of rats per group to that used 
in most previous toxicity tests of GM foods, 
including Missouri-based Monsanto's own 
tests of NK603 maize. Such regulatory tests 
monitor rats for 90 days, and guidelines from 
the Organisation for Economic Co-operation 
and Development (OECD) state that ten rats of 
each sex per group over that time span is suf- 
ficient because the rats are relatively young. But 
Séralini’s study was over two years — almost a 
rat’s lifespan — and for tests of this duration, the 
OECD recommends at least 20 rats of each sex 


per group for chemical-toxicity studies, and at 
least 50 for carcinogenicity studies. 

Moreover, the study used Sprague-Dawley 
rats, which both reviews note are prone to devel- 
oping spontaneous tumours. Data provided to 
Nature by Harlan Laboratories, which supplied 
the rats in the study, show that only one-third 
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Gilles-Eric Séralini’s book describes his latest GM research. 


of males, and less than one-half of females, live 
to 104 weeks. By comparison, its Han Wistar 
rats have greater than 70% survival at 104 weeks, 
and fewer tumours. OECD guidelines state that 
for two-year experiments, rats should have a 
survival rate ofat least 50% at 104 weeks. If they 
do not, each treatment group should include 
even more animals — 65 or more of each sex. 
“There is a high probability that the findings 
in relation to the tumour incidence are due 
to chance, given the low number of animals 
and the spontaneous occurrence of tumours 
in Sprague-Dawley rats,’ concludes the EFSA 
report. In response to the EFSAss assessment, the 
European Federation of Biotechnology — an 
umbrella body in Barcelona, Spain, that repre- 
sents biotech researchers, institutes and compa- 
nies across Europe — called for the study to be 
retracted, describing its publication as a “dan- 
gerous case of failure of the peer-review system’ 
Séralini argues that a battery of observations 
in the study reinforces his tumour-incidence 
and mortality claims. “Of course, this should 
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be replicated by others, but we believe in these 
results,” he says. He agrees that more rats would 
have boosted his study’s statistical power, but 
says that he did not design the experiment to 
show differences in tumour incidences, because 
he was not expecting to find any — no previous 
tests on GM foods had suggested a cancer risk. 

Yet Séralini has promoted the can- 
cer results as the study’s major finding, 
through a tightly orchestrated media 
offensive that began last month and 
included the release of a book and a film 
about the work. Only a select group of 
journalists (not including Nature) was 
given access to the embargoed paper, and 
each writer was required to sign a highly 
unusual confidentiality agreement, seen 
by Nature, which prevented them from 
discussing the paper with other scientists 
before the embargo expired. 

Journalists often receive embargoed 
journal articles, and standard practice is 
to solicit independent assessments before 
the paper is published. The agreement for 
this paper, however, did not allow any dis- 
closure and threatened a severe penalty for 
non-compliance: “A refund of the cost of 
the study of several million euros would be 
considered damages if the premature dis- 
closure questioned the release of the study.’ 

In an exceptional move, the ethics commit- 
tee of the French National Centre for Scien- 
tific Research (CNRS) last week decried the 
public-relations offensive as inappropriate for 
a high-quality and objective scientific debate, 
and reminded researchers working on con- 
troversial topics of the need to report results 
responsibly to the public. 

Meanwhile, Séralini says that he won't make 
any data available to the EFSA and the BfR 
until the EFSA makes public all the data under- 
pinning its 2003 approval of NK603 maize for 
human consumption and animal feed. He has 
also criticized the EFSA, and most other detrac- 
tors of his study, for alleged conflicts of inter- 
est, claiming that he is “being attacked in an 
extremely dishonest fashion by lobbies passing 
themselves off as the scientific community”. 

The journal that published his study, Food 
and Chemical Toxicology, said last week in a 
statement that it “welcomes any and all ‘Letters 
to the Editor’ that have questions and concerns 
about this paper”. = 
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US biologists decry 
funding changes 


National Science Foundation policy may harm tenure seekers. 


BY HELEN SHEN 


CC his is the farthest I think that I’ve 
| stuck my neck out,’ says Sarah 
Hobbie, an ecologist at the Univer- 
sity of Minnesota in Saint Paul. Policy changes 
at the US National Science Foundation (NSF) 
are giving life scientists in some fields fewer 
opportunities to apply for funding, so Hobbie 
is now at the forefront of a campaign to per- 
suade the NSF to rethink its approach. 

In August, Hobbie garnered more than 550 
signatures with an open letter to the NSF that 
says the changes, put in place last year, are 
jeopardizing the careers of young scientists. 
She has also surveyed the ecology commu- 
nity, which relies heavily on NSF funding, 
about what it thought was the best way for- 
ward for the agency. Of the 1,622 responses 
she received, only 13% favoured the NSF’s 
changes over an alternative devised by Hob- 
bie and her colleagues. 

At the heart of the dispute are changes to 
the NSF's application process for grants in 
organismal and environmental biology. The 
two divisions account for about half of the 
US$700 million that the agency will spend on 
the life sciences in 2012. In the past, research- 
ers had two chances to apply for funding each 
year. Now they have just one opportunity, in 
two stages. For the first stage, researchers sub- 
mit a short ‘pre-proposal’ in January, which 
is then screened by reviewers. Researchers 
who clear this hur- 


dle are invited to “With a single 
submit full proposals decision, you’ve 
by 2 August. Grant madean abrupt 
winners are awarded change that will 
funding the follow- affect alot of 
ing year. people. ”? 


The NSF says 
that it expects to award roughly the same 
number of grants as in previous years, but 
the revised process will reduce the burden on 
grant reviewers. “The old system had reached 
breaking point,” says John Wingfield, who 
heads the biology directorate at the NSF in 
Arlington, Virginia. He adds that the change 
has “created some interest in other directo- 
rates”. 

Hobbie, whose two pre-proposals were 
turned down this year, must now wait until 
January to try again for the chance to sub- 
mit a full proposal in August. If approved, 


that project would probably not receive 
funding until 2014. She says that such long 
gaps between resubmissions have a domino 
effect because the delays to research and pub- 
lications can be career-limiting for younger 
researchers striving to establish themselves 
and earn tenure. 

“Instead of 10 or 12 opportunities to put in 
a proposal before tenure, it’s 5 or 6 now,’ says 
Jim Heffernan, an ecologist in his first year of 
an assistant professorship at Duke University 
in Durham, North Carolina. 

Another change, which prohibits research- 
ers from acting as principal investigators on 
more than two proposals a year, is equally 
unpopular. “Strategically, I need to be the 
lead principal investigator on whatever grant 
I get,” says Stan Harpole, an ecologist and 
assistant professor at Iowa State University 
in Ames, who is two years from his tenure 
evaluation. 


COMMON CONCERN 

Simmering criticisms of the new policies 
bubbled over in August at a panel discussion 
during the annual meeting of the Ecological 
Society of America in Portland, Oregon. It was 
there, Hobbie says, that she realized that her 
colleagues shared her concerns. The open let- 
ter she sent later that month led to a private 
meeting with Wingfield on 10 September. A 
week later, the heads of 33 life-sciences depart- 
ments and programme leaders from several US 
universities sent a letter to the NSF expressing 
similar worries. 

“With a single decision at the NSE, you've 
made an abrupt change that will affect a lot of 
people,” says Kenneth Petren, head of biologi- 
cal sciences at the University of Cincinnati in 
Ohio, who signed the second letter. “I’m most 
concerned younger people will decide there’s 
a better career path for them.” 

On 3 October, Wingfield told Nature that 
he understands the concerns, but that the NSF 
has opted not to reverse its policy for now. 
Instead, he says, the agency will evaluate the 
changes over the next few grant cycles. Hobbie 
and Petren say that such monitoring should 
involve dialogue with affected researchers. 

“Engaging with the community is the only 
way they’re going to assess certain types of 
impacts on science,’ says Hobbie. “I’m quite 
disappointed that I haven't heard anything in 
that direction” = 
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battlefield; an abusive parent; the ongo- 

ing struggle to make ends meet; a pile-up 

of unanswered e-mails — stress can take 
many forms. But stressful situations have some- 
thing in common: they trigger reactions such as 
fear and a surge of hormones. These responses 
can be beneficial in the short term but, if severe 
or prolonged, can damage the mind, triggering 
conditions ranging from depression to post- 
traumatic stress disorder. That much has been 
known for decades; now researchers are getting 
to grips with how stress can alter the biology 
of the brain, and tip a mind into illness. Here, 
Nature takes a look at what they have learned, 
and at the gaps that remain. 

Modern life — with more people, more 
pressure and more gadgets — certainly 
seems more stressful than life in the past, but 
how does it affect the mind? Some scientists 
are seeking answers by examining how the 
brains of city- and country-dwellers process 
stressful situations (see page 162). Others are 
looking for the molecular scars left by stress. 
Neuroscientist Eric Nestler argues that stress 
may influence the brain through epigenetics, 
mechanisms that change how genes behave 
(see page 171). Biologists Elizabeth Blackburn 
and Elissa Epel, meanwhile, suggest that stress 


A special section 
of Nature traces 
the emerging links 
between stress and 
mental illness. 


causes chronic diseases in part by shortening 
telomeres, structures that cap and protect the 
ends of chromosomes (see page 169). 

Still others hope to draw lessons from people 
who can bounce back from stress — even 
from experiences as devastating as abduction 
and rape. By tracing the social and biological 
factors that help these people to recover, the 
research could yield ways to make the rest of 
us more resilient (see page 165). m SEE EDITORIAL 
P.143 AND CAREERS P.299 


STRESS AND RESILIENCE 


The links between adversity and 
mental illness. nature.com/stress 
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The links between adversity and 


mental illness. nature.com/stress to do much more in less time. 
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London's southward sprawl, began an unusual tally. They started 

to keep case records for every person in the area who was diagnosed 
with schizophrenia, depression, bipolar disorder or any other psychi- 
atric condition. Decades later, when psychiatrists looked back across 
the data, they saw a surprising trend: the incidence of schizophrenia 
had more or less doubled, from around 11 per 100,000 inhabitants per 
year in 1965 to 23 per 100,000 in 1997 — a period when there was no 
such rise in the general population (J. Boydell et al. Br. J. Psychiatry 182, 
45-49; 2003). The result raised a question in many researchers’ minds: 
could the stress of city life be increasing the risk of schizophrenia and 


E 1965, health authorities in Camberwell, a bustling quarter of 


The question is an urgent one. Back in 1950, less than one-third of the 
world’s population lived in cities. Now, lured by the prospect of work and 
opportunity, more than half do. Mental illnesses already comprise the 
world’s biggest disease burden after infectious diseases and, although 
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| D E A TH AT TH F STRESS 0 F atric ailments doubled between 2000 and 2010; in North America, up 


to 40% of disability claims for work absence are related to depression, 


M Hi) NH) FERN CITY {| FE iS A Bi RE Fij i x G according to some estimates. “It seems that cities may be making us 
ick,” I at th i f Psychiatry i , whi 
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Anecdotally, the link between cities, stress and mental health makes 
BY ALISON ABBOTT sense. Psychiatrists know that stress can trigger mental disorders — and 
modern city life is widely perceived as stressful. City dwellers typically 
face more noise, more crime, more slums and more people jostling on the 
Z| STRESS AND RESI LIEN CE streets than do those outside urban areas. Those who have jobs complain 
of growing demands on them in the workplace, where they are expected 


But the idea has not been widely tested. It is difficult to study whether 
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something as complex as a ‘city environment’ has an impact on the 
brain. To complicate matters, many growing cities include immigrant 
populations, which already have an increased risk of psychiatric disease 
associated with social isolation. 

Now, a few scientists are tackling the question head on, using func- 
tional brain imaging and digital monitoring to see how people living in 
cities and rural areas differ in the way that their brains process stress- 
ful situations. “Yes, city-stress is a big, messy concept, but I believed 
it should be possible to at least see if brains of city-dwellers looked 
somehow different,” says Andreas Meyer-Lindenberg, director of the 
Central Institute for Mental Health in Mannheim, Germany. And if 
scientists can work out what aspects of the city are the most stressful, 
the findings might even help to improve the design of urban areas. 
“Everyone wants the city to be beautiful but no-one knows what that 
means,’ says Meyer-Lindenberg. Wider streets? Taller buildings? More 
trees? “Architects theorize a lot, but this type of project could deliver a 
scientific basis for a city code.” 


RELENTLESS STRESS 

Considered from an evolutionary standpoint, the physiological stress 
response is definitely a good thing: it helps mammals to survive. Any 
threat, whether from a predator, dwindling food supplies or an aggres- 
sive enemy, triggers release of hormones such as cortisol and adrena- 
line. These hormones raise levels of sugar in the blood and redistribute 
blood flow to muscles and lungs, so that animals can respond to the 
threat by running, hunting or fighting. 

Problems arise when the stress response doesn't switch off. Stress- 
hormone levels that stay too high for too long cause high blood pressure 
and suppress the immune system. And, although the mechanisms are 
unknown, scientists agree that severe or prolonged stress also raise the 
risk of psychiatric disease — most brutally in those who have a genetic 
predisposition, and when the stress occurs while the brain is still devel- 
oping. In theory, then, the ceaseless challenges of the city could produce 
this kind of damaging stress. Some fear that they could end up driving 
an increase in mental illness around the world. 

The only signs of an increase, however, come from relatively small, 
local studies. “It's frustrating — we feel it should be rising,’ says Ronald 
Kessler, a mental-health epidemiologist at Harvard Medical School in 
Boston, Massachusetts. “But globally we have not seen this, and there are 
also studies which indicate it isn’t even rising in cities.” However, reliable 
data on the prevalence of psychiatric disease are hard to find because 
diagnoses are often imprecise or incompletely recorded. The Camber- 
well study was influential because, unusually, it captured all those who 
were diagnosed with a mental disorder, even if they were not admitted 
to hospitals, and the researchers involved carefully reviewed every case. 

Published in 2003, the Camberwell study deeply impressed Meyer- 
Lindenberg, who was then at the US National Institute for Mental 
Health in Bethesda, Maryland, researching how genetic risk factors 
for schizophrenia affect brain function. As a student in Manhattan 
some years earlier, Meyer-Lindenberg says, “I had been struck by the 
number of homeless mentally ill people on the streets, and the problems 
of the city somehow resonated with me”. He wondered if city living was 
somehow making the brain more susceptible to mental-health condi- 
tions. When he returned to his native Germany in 2007, he decided to 
tackle the question directly. But at the time, Meyer-Lindenberg says, 
“people said the effect would be too subtle to make sense of”. 

Yet the results of his study, published last year in Nature (E Lederbogen 
et al. Nature 474, 498-501; 2011), clearly showed that people who grow 
up in cities process negative emotions such as stress differently from 
those who move to the city as adults. His team scanned the brains of 55 
healthy volunteers as they carried out arithmetic tasks under a constant 
bombardment of negative social feedback. “Wed always let them know 
through headphones that we thought they were failing, or at least not 
doing as well as other subjects wed had in the scanner,” says Meyer- 
Lindenberg. “In one set of experiments we let them see our impatient 
faces on computer screens.” 
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Rush hour: phone apps are helping researchers to survey stress over the day. 


This social stress activated two brain areas — but the pattern 
depended on the volunteers’ histories of urban living. The amygdala, 
which processes emotion, showed much greater activity in people who 
were currently living in a city. And the cingulate cortex, which helps 
to regulate the amygdala and processes negative emotion, responded 
more strongly in those brought up in large cities than in those brought 
up in the countryside, irrespective of where they lived now. Meyer- 
Lindenberg thinks that this over-responsiveness to stress could make 
city-dwellers more prone to psychiatric conditions such as schizophre- 
nia — and his results chime with the idea that stress in childhood or 
adolescence can have a lasting effect on the brain’s development and 
increase susceptibility to psychiatric disease. 

Other scientists are following Meyer-Lindenberg’s lead. Daniel 
Weinberger, director of the Lieber Institute for Brain Development 
in Baltimore, Maryland (and a self-confessed addict “to the cultural 
stimulation of the city”), is planning a huge, long-term project to study 
environmental and genetic risk factors for schizophrenia in China, 
where urbanization is happening at lightning speed. The proportion 
of people living in cities there has doubled in the past two decades, to 
more than half. Together with colleagues at Peking University in Beijing, 


“THIS TYPE OF PROJECT COULD 
DELIVER A SCIENTIFIC BASIS FOR 
A CITY CODE.” 


Weinberger hopes to study thousands of people who moved to Beijing 
from the countryside before they were 12 years old, after they turned 18 
and between the two ages. He will use brain imaging and genetic analy- 
ses to try to understand how urban upbringing and genes alter cogni- 
tion and reasoning, functions that are often disrupted in schizophrenia. 

Researchers suspect that the stress of city living leads to psychiat- 
ric disease mainly in people who are already at risk because of other 
environmental stresses or because they carry risk genes. One candidate 
gene, the details of which have not yet been published, has emerged 
from a large brain-imaging study being conducted by Meyer-Linden- 
berg in Iceland. He chose more than 500 people who had been identi- 
fied by the Reykjavik-based company deCODE Genetics as carrying 
rare mutations that confer a high risk of schizophrenia, and subjected 
them to functional magnetic resonance imaging tests similar to the 
ones he used in his city study. “We've already found that people carry- 
ing that one particular gene variant activate the cingulate cortex when 
they process social stress, just like those who were brought up in cities,” 
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STRESS AND THE CITY 


RESEARCHERS ARE TRYING TO IDENTIFY WHICH PARTS 
OF CITY LIFE ARE THE MOST STRESSFUL, AND HOW THIS 
MIGHT CONTRIBUTE TO MENTAL-HEALTH DISORDERS. 
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he says. He expects to find many more gene candidates through the 
project, which will run for several more years. 

Identifying which parts of a busy city life are the most stressful is 
another massive challenge (see ‘Stress and the city’). The common 
urban experience of feeling different from your neighbours because 
of socioeconomic status or ethnicity could be one factor, Meyer-Lin- 
denberg thinks. If so, immigrant groups, who often experience isola- 
tion, may be processing stress in a similar way to city-dwellers. He is 
now testing this hypothesis in the children of immigrants to Germany. 
(First-generation immigrants are not suitable because the stress of 
social isolation could be confounded by the stress of moving country.) 

Jim van Os, a psychiatrist and epidemiologist at Maastricht Uni- 
versity in the Netherlands, is planning a detailed look at city living to 
identify sources of stress. “It had been slowly dawning on me that as 


“NOTHING IN MENTAL HEALTH WILL 
BECOME CLEAR UNLESS WE CAN 
LOOK AT THE ENVIRONMENT.” 


we know that the brain interacts with the environment, nothing about 
mental health will become clear unless we can look at the environment,’ 
he says. Van Os has developed a smart-phone app that allows partici- 
pants to record their moods, thoughts, location and activities as they 
go about their daily lives. “This is important because mood and affect 
change dynamically in the brain, just as blood pressure does,” he says. 

In a €2-million (US$2.6-million) field study, van Os intends to 
use the app to collect regular information on these factors from 264 
people who have started to show psychological disturbance. He will 
combine these data with brain imaging to test his hypothesis that risk 
of progressing from disturbance to full-blown psychosis is connected 
to a distorted ability to learn which aspects of a new environment are 
rewarding and which are threatening. “For example,’ he says, “if you 
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move to a new district, you'll need to quickly learn which neighbours 
you like enough to develop a relationship with, and how to interpret 
social signals that could be harmful” 

Meyer-Lindenberg is planning an even more technologically 
ambitious project with geoscientists at the nearby University of Hei- 
delberg, who have generated a high-resolution map of their city, and 
physicists at the Karlsruhe Institute of Technology in Germany, who 
have developed a mobile device that allows people to be tracked and 
tested for a week as they walk and work around Heidelberg. The device 
can recognize when participants reach a specific location — such as a 
green space or a particularly noisy intersection — and instantly ques- 
tion them about their state of mind or send them a cognitive test to be 
completed on the spot. The scientists will then ask the participants to 
come into the lab for brain-imaging studies that examine how they 
process stress and emotion. By correlating the imaging data with their 
states of mind at different locations, the team hopes to trace how differ- 
ent aspects of city life affect the brain — whether, for example, strolling 
through a park really does have a calming influence on the amygdala 
and cingulate cortex. 

It is a high-risk project that has yet to tempt funders. But Meyer- 
Lindenberg sees the future of the city in it. So too does Annette 
Rudolph-Cleff, an architect and city planner at the Technical Uni- 
versity of Darmstadt in Germany, who contacted Meyer-Lindenberg 
after reading his Nature paper last year and is now advising him on the 
project. "We know far too little about the city at the moment and we 
need these new technologies and approaches to help us make decisions 
about how the city should best be developed,” she says. 

As well as helping in the design of future cities, such work might 
also pinpoint the most stressful parts of an existing metropolis — and 
help to make a case for urban regeneration. Cities are already great 
economic and cultural incubators; Rudolph-Cleffhopes that the new 
science of urban stress could also allow them to be turned into cradles 
of mental health. mSEE EDITORIALP. 143 


Alison Abbott is Nature’ senior European correspondent. 
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ROOTS O 
RESILIENCE 


Most people bounce 
back from trauma 
— but some never 
recover. Scientists 
are trying to work 
out what underlies 
the difference. 


BY VIRGINIA HUGHES 

na chilly, January night in 1986, 
O Elizabeth Ebaugh carried a bag of 

groceries across the quiet car park of 
ashopping plaza in the suburbs of Washington 
DC. She got into her car and tossed the bag onto 
the empty passenger seat. But as she tried to 
close the door, she found it blocked by a slight, 
unkempt man with a big knife. He forced her to 
slide over and took her place behind the wheel. 

The man drove aimlessly along country 
roads, ranting about his girlfriend’s infidelity 
and the time he had spent in jail. Ebaugh, a 
psychotherapist who was 30 years old at the 
time, used her training to try to calm the man 
and negotiate her freedom. But after several 
hours and a few stops, he took her to a motel, 
watched a pornographic film and raped her. 
Then he forced her back into the car. 

She pleaded with him to let her go, and he 
said that he would. So when he stopped ona 
bridge at around 2 a.m. and told her to get out, 
she thought she was free. Then he motioned 
for her to jump. “That's the time where my sys- 
tem, I think, just lost it,” Ebaugh recalls. Suc- 
cumbing to the terror and exhaustion of the 
night, she fainted. 

Ebaugh awoke in freefall. The man had 
thrown her, limp and handcuffed, off the 


Ar 
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bridge four storeys above a river reservoir. 
When she hit the frigid water, she turned onto 
her back and started kicking. “At that point, 
there was no part of me that thought I wasn’t 
going to make it,” she says. 

Few people will experience psychological 
and physical abuse as terrible as the abuse 
Ebaugh endured that night. But extreme 
stress is not unusual. In the United States, an 
estimated 50-60% of people will experience 
a traumatic event at some point in their lives, 
whether through military combat, assault, 
a serious car accident or a natural disaster. 
Acute stress triggers an intense physiological 
response and cements an association in the 
brain's circuits between the event and fear. If 
this association lingers for more than a month, 
as it does for about 8% of trauma victims, it is 
considered to be post-traumatic stress disorder 
(PTSD). The three main criteria for diagno- 
sis are recurring and frightening memories, 
avoidance of any potential triggers for such 
memories and a heightened state of arousal. 

Ebaugh experienced these symptoms in the 
months after her attack and was diagnosed 
with PTSD. But with the help of friends, psy- 
chologists and spiritual practices, she recov- 
ered. After about five years, she no longer met 
the criteria for the disorder. She opened her 
own private practice, married and had a son. 

About two-thirds of people diagnosed 
with PTSD eventually recover. “The vast 
majority of people actually do OK in the face of 
horrendous stresses and traumas,” says Robert 
Ursano, director of the Center for the Study 
of Traumatic Stress at the Uniformed Services 
University of the Health Sciences in Bethesda, 
Maryland. Ursano and other research- 
ers want to know what underlies people’s 
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mental strength. “How does one unde 
the resilience of the human spirit?” he i 


Since the 1970s, scientists have learned that 
several psychosocial factors — such as strong 
social networks, recalling and confronting 
fears and an optimistic outlook — help people 
to recover. But today, scientists in the field are 
searching for the biological factors involved. 
Some have found specific genetic variants 
in humans and in animals that influence an 
individual’s odds of developing PTSD. Other 
groups are investigating how the body and 
brain change during the recovery process and 
why psychological interventions do not always 
work. The hope is that this research might lead 
to therapies that enhance resilience. 


ANATURAL RESPONSE 

Although no one can fully understand what 
was going on in Ebaugh’s mind during her 
attack, scientists have some idea of what was 
happening to her body. As soonas Ebaugh saw 
her attacker and his knife, her brain’s pituitary 
gland sent signals to her adrenal glands, atop 
the kidneys, to start pumping out the stress 
hormones adrenaline and cortisol. In turn, 
her pulse quickened, her blood pressure rose 
and beads of sweat formed on her skin. Her 
senses sharpened and her neural circuits 
formed strong memories, so that if she ever 
encountered this threat in the future, she 
would remember the fear and flee. 

The repercussions were profound. For the 
first week after the abduction, “I felt like a new- 
born baby’, Ebaugh says, “like I had to be held, 
or at least be in the presence of somebody”. 
She shivered constantly, was easily startled 
and felt only fear. She could not go near the 
grocery store. 

Nearly every trauma victim experiences 
PTSD symptoms to some degree. Many 
people who are diagnosed with the disorder go 
on to have severe depression, substance-abuse 
problems or suicidal thoughts. PTSD can take 
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a horrific toll. Between 2005 and 2009, as a 
growing number of soldiers faced multiple 
deployments in Iraq and Afghanistan, 
suicide rates in the US Army and Marines 
nearly doubled. 

Over the past two decades, researchers have 
used various kinds of imaging techniques to 
peer inside the brains of trauma victims. These 
studies report that in people with PTSD, two 
areas of the brain that are sensitive to stress 
shrink: the hippocampus, a deep region in the 
limbic system important for memory, and the 
anterior cingulate cortex (ACC), a part of the 
prefrontal cortex that is involved in reasoning 
and decision-making. Functional magnetic 
resonance imaging (fMRI), which tracks 
blood flow in the brain, has revealed that 
when people who have PTSD are reminded of 
the trauma, they tend to have an underactive 
prefrontal cortex and an overactive amygdala, 
another limbic brain region, which processes 
fear and emotion (see “The signature of stress). 

People who experience trauma but do not 
develop PTSD, on the other hand, show more 
activity in the prefrontal cortex. In August’, 
Kerry Ressler, a neuroscientist at Emory Uni- 
versity in Atlanta, Georgia, and his colleagues 
showed that these resilient individuals have 
stronger physical connections between the 
ACC and the hippocampus. This suggests that 
resilience depends partly on communication 
between the reasoning circuitry in the cor- 
tex and the emotional circuitry of the limbic 
system. “It’s as if [resilient people] can have a 
very healthy response to negative stimuli,” says 
Dennis Charney, a psychiatrist at the Mount 
Sinai School of Medicine in New York, who has 
conducted several brain-imaging studies of rape 
victims, soldiers and other trauma survivors. 


ENVIRONMENTAL PROTECTION 

After her abduction, Ebaugh began seeing a 
psychotherapist and several alternative-med- 
icine practitioners. But more than anything 


THE SIGNATURE OF STRESS 


A traumatic experience elicits an immediate physiological 


response, and has long-term effects on the brain. 


else, she attributes her resilience to being 
surrounded by caring people — beginning 
within minutes of her escape. 

After Ebaugh crawled up the rocky 
riverbank, a truck driver picked her up, took 
her to a nearby convenience store and bought 
her a cup of hot tea. Police, when they arrived, 
were sympathetic and patient. The doctor at 
the hospital, she says, treated her like a daugh- 
ter. A close friend took her in for a time. And 
her family offered reassurance and emotional 
support. “For the first month, I almost had to 
tell people to stop coming because I was so sur- 
rounded by friends and community,’ she says. 

Studies of many kinds of trauma have shown 
that social support is a strong buffer against 
PTSD and other psychological problems. 
James Coan, a psychologist at the University 
of Virginia in Charlottesville, has done a series 
of experiments in which women lie in an {MRI 
scanner and see ‘threat cues’ on a screen. They 
are told that between 4 and 10 seconds later, they 
may receive a small electric shock on the ankle. 
The cue triggers sensory arousal and activates 
brain regions associated with fear and anxiety, 
but when the women hold the hands of their 
husbands’ or friends’, these responses diminish. 

Social interactions are complex and involve 
many brain circuits and chemicals; no one 
knows exactly why they provide relief. Being 
touched by someone is thought to stimulate 
the release of natural opioids, such as endor- 
phins, in the brain. The ACC is packed with 
opioid receptors, suggesting that touch could 
influence its response to stress. 

Other clues come from the hormone 
oxytocin, which courses through the brain 
during social interaction and has been shown 
to boost trust and reduce anxiety. In one imag- 
ing study’, participants viewed frightening 
images after receiving nasal sprays of either 
oxytocin or a placebo. Those who sniffed 
oxytocin showed reduced activation in the 
amygdala and weaker connections between 


« During stressful situations, the 
pituitary gland, in the brain, 
releases neurohormones. 


e|n response, the 
adrenal glands release 
adrenaline and cortisol. 


e Adrenaline induces 

quick, fight-or-flight 

responses. 
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© Cortisol elicits long- 
lasting stress 
responses and shuts 
down the release of 
neurohormones. 


(PTSD). 
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« Anterior cingulate 


Involved in rational 
decision-making. 
Appears smaller in 
people with 
post-traumatic 
stress disorder 


the amygdala and the brainstem, which control 
some stress responses, such as heart rate. The 
oxytocin surge that comes from being around 
other people could, like endorphins, help to 
reduce the stress response. 

Past social interactions may also affect how 
a person responds to trauma. Chronic neglect 
and abuse unquestionably lead to a host of 
psychological problems and a greater risk 
of PTSD. Ressler, however, points to a factor 
that is well recognized but poorly understood: 
‘stress inoculation. Researchers have found 
that rodents’ and monkeys’, at least, are more 
resilient later in life if they experience isolated 
stress events, such as a shock or a brief separa- 
tion from their mothers, early in infancy. 

Ebaugh says that early stress — and the con- 
fidence she gained in conquering it — helped 
her to recover from her traumatic abduction. 
She was born with a condition that made her 
feet turn inwards. At age ten, she underwent 
surgery to rebuild her knees followed by a year 
of intensive rehabilitation. “It wasn't foreign 
to me to be hurt and have to walk the walk of 
being strong again,” she says. “It’s like a muscle, 
I think, that gets built up.” 


RESILIENT BY NATURE 
Although most people, like Ebaugh, recover 
from trauma, some never do. Some scientists 
are seeking explanations for such differences 
in the epigenome, the chemical modifica- 
tions that help to switch genes on and off (see 
page 171). Others are looking in the genes 
themselves. Take, for example, FKBP5, a 
gene involved in hormonal feedback loops 
in the brain that drive the stress response. In 
2008, Ressler and his colleagues showed that 
in low-income, inner-city residents who had 
been physically or sexually abused as children, 
certain variants in FKBP5 predisposed them 
to developing PTSD symptoms in adulthood. 
Other variants offered protection’. 

The most talked-about biological marker of 


e Hippocampus 
Has roles in long-term 
memory formation. 
Appears smaller in 
people with PTSD. 
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Processes memory and emotional 
reactions and is overly responsive 
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resilience is neuropeptide Y (NPY), a hormone 
released in the brain during stress. Unlike the 
stress hormones that put the body on high alert 
in response to trauma, NPY acts at receptors in 
several parts of the brain — including the amyg- 
dala, prefrontal cortex, hippocampus and brain- 
stem — to help shut off the alarm. “In resiliency, 
these brake systems are turning out to be the 
most relevant,’ says Renu Sah, a neuroscientist 
at the University of Cincinnati in Ohio. 

Interest in NPY and resilience took off in 
2000, partly because of a study of healthy US 
Army soldiers who participated in a survival 
course designed to simulate the conditions 
endured by prisoners of war, such as food and 
sleep deprivation, isolation and intense inter- 
rogations®. NPY levels went up in the soldiers’ 
blood within hours of the interrogations. 
Special Forces soldiers who had trained to be 
resilient had significantly higher NPY levels 
than typical soldiers. 

Researchers are now conducting animal 
experiments to study how NPY works. In one 
experiment, a team at the Indiana University 
School of Medicine in Indianapolis restrained 
arat in a tight-fitting plastic pouch for 30 min- 
utes, then released it into a box with another 
rat’. The restraint made the rat so anxious that 
it avoided interacting with the other animal for 
90 minutes. But when rats were injected with 
NPY before the treatment, they interacted with 
cage-mates as if nothing had happened. 

The work could lead to treatments. Charney’s 
group at Mount Sinai is carrying out a phase II 
clinical trial of an NPY nasal spray for indi- 
viduals with PTSD. Others are investigating 
small molecules that can cross the blood-brain 
barrier and block certain receptors that control 
NPY release. 


CONFLICT RESOLUTION 

The US military is leading the hunt for 
additional biological markers of resilience. 
Since 2008 — driven in part by soaring suicide 
rates among soldiers — the US Army has col- 
laborated with the National Institute of Mental 
Health and several academic institutions on a 
US$65-million project called Army STARRS 
(the Study to Assess Risk and Resilience in 
Servicemembers). The project has many parts, 
including a retrospective look at de-identified 
medical and administrative records for 1.6 mil- 
lion soldiers, in search of early warnings of 
suicide, PTSD and other mental-health prob- 
lems. STARRS scientists are also collecting data 
— suchas blood samples, medical histories and 
cognitive testing results — on tens of thousands 
of current soldiers. The researchers expect to 
publish their first findings early next year. 

The military also funds research into animal 
models of resilience. Most rodents will quickly 
learn to associate painful foot shocks with a cer- 
tain cue, such as a tone ora specific cage. After 
they have learned the association, the rodents 
freeze on experiencing the cue, even without 
the shock. Several years ago, Abraham Palmer, 
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Elizabeth Ebaugbh is finally comfortable visiting the bridge from which she was thrown 26 years ago. 


a geneticist now at the University of Chicago 
in Illinois, made a line of resilient mice by 
selectively breeding mice that froze for abnor- 
mally short periods of time. After about four 
generations, he had mice that froze for about 
half the time of typical animals’®. The effect 
was not due to a difference in pain sensitivity or 
general learning ability. This month, Luke 
Johnson, a neuroscientist at the Uniformed 
Services University, will present data at 
the Society for Neuroscience meeting in 
New Orleans, Louisiana, showing that these 
mice have uncommonly low activity in the 
amygdala and hippocampus, consistent with 
human studies of PTSD resilience. They also 
have low levels of corticosterone, a stress 
hormone, in their urine. 

“They have a quieter system, even at rest,” 
says Johnson. “It suggests that there are under- 
lying biological traits that are associated with 
the capacity of the animal for fear memory.’ In 
future experiments, Johnson plans to use the 
mice to study NPY and potential new therapies. 

Ebaugh, who now specializes in therapy 
for trauma victims, agrees that drug-based 
treatments could aid in recovery. But some 
people may find relief elsewhere. Religious 
practices — especially those that emphasize 
altruism, community and having a purpose in 
life — have been found to help trauma victims 


to overcome PTSD. Ebaugh says that yoga, 
meditation, natural remedies and acupuncture 
worked for her. 

Today, she buys groceries at the plaza where 
she was abducted, and she drives over the bridge 
she was thrown from as though it were any other 
road. She says that she has forgiven the man who 
abducted her. When she reflects on what he did, 
it’s not with anger, sadness or fear. “It doesn't feel 
like it affects my life at all at this point, at least 
not in a negative way,’ she says. “In a positive 
way, it’s been a huge teacher.” m 


Virginia Hughes is a freelance science writer 
in New York city. 


1. Fani, N. et a/. Neuropsychopharmacology http:// 
dx.doi.org/10.1038/npp.2012.146 (2012). 
. Coan, J. A., Schaefer, H. S. & Davidson, R. J. Psychol. 
Sci. 17, 1032-1039 (2006). 
3. Coan, J.A., Beckes, L. & Allen J. P. Int. J. 
Psychophysiol. (in the press). 
4. Kirsch, P. et al. J. Neurosci. 25, 11489-11493 
(2005). 
5. Maier, S. F. et al. Dialogues Clin. Neurosci. 8, 
397-406 (2006). 
6. Lyons, D. M. & Parker, K. J.J. Traum. Stress 20, 
423-433 (2007). 
7. Binder, E. B. etal. J. Am. Med. Assoc. 299, 1291-1305 
(2008). 
8. Morgan, C.A. Ill et al. Biol. Psychiatry 47, 902-909 
(2000). 
9. Sajdyk, T. J. et al. J. Neurosci. 28, 893-903 (2008). 
10.Ponder, C. A. et al. Genes Brain Behav. 6, 736-749 
(2007). 


I~) 


11 OCTOBER 2012 | VOL 490 | NATURE | 167 


© 2012 Macmillan Publishers Limited. All rights reserved 


ILLUSTRATION BY PADDY MILLS 


COMMENT 


STRESS Epigenetics may set | 


resilient and vulnerable 
people apart p.171 


HEALTH A call to halt the FILM Award-winning movie 
medicalization of explores the dark past of 
sleep p.173 US psychiatric care p.174 


JOURNALS Are cash incentives 
inflating publications and 
citations in China? p.176 


Too toxic to ignore 


Astark warning about the societal costs of stress comes from links between shortened 
telomeres, chronic stress and disease, say Elizabeth H. Blackburn and Elissa S. Epel. 


Cameron Diaz, playing a woman whose 
life is spinning out of control, exclaims: 
“Severe stress ... causes the DNA in our cells 
to shrink until they can no longer replicate. 
So when were stressed we look haggard.” 
Hollywood got that science right. The 
DNA to which Diaz’s character alludes 
is the segment that makes up telomeres, 
structures that cap and protect the ends of 
chromosomes. She was referring to our 2004 
publication’ — the first to link chronic psy- 
chological stress to compromised telomere 
maintenance. 
Since that paper, researchers have consist- 
ently found that various types of chronic stress 
are linked to — and probably cause — shorter 


I the 2006 film The Holiday, the actress 


telomeres. Meanwhile, both telomere short- 
ness and stress have independently been asso- 
ciated with several common conditions, such 
as cardiovascular disease and diabetes. 
These associations are so widespread 
and consistent that even without a detailed 
understanding of the biochemical pathways 
involved, the message is clear. Failure to 
alleviate severe stress caused by prolonged 
threats such as war, financial hardship, 
abuse and emotional neglect, particularly in 
children, will result in exponentially higher 


4 STRESS AND RESILIENCE 


The links between adversity and 
mental illness. nature.com/stress 
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costs further down the line — personal, 
economic and otherwise. 


SHORT ODDS 

Human telomeres are several kilobases of 
repeated sequences of DNA bound by spe- 
cialized protective proteins. A peculiarity of 
the DNA-replication mechanism causes tel- 
omeres to shorten as cells divide. Sometimes 
the enzyme telomerase can replenish the lost 
DNA, but as we age, our telomeres dwindle. If 
they get too short, through ageing or because 
telomere maintenance goes awry, cells 
can stop dividing. Such cells also become 
malfunctional. For instance, they can start 
secreting factors that cause inflammation or 
trigger the development of tumours. > 


OCTOBER 2012 | VOL 490 | NATURE | 169 


> In2004, we compared telomere lengths 
in the white blood cells of mothers of chroni- 
cally ill children to those in mothers of healthy 
children’. The longer a woman had spent 
being the main carer of her ill child (the chil- 
dren’s conditions ranged from gut disorders 
to autism), the shorter were her telomeres. 
Moreover, in both groups, the more severe 
her psychological stress — as judged by her 
answers to standardized questions about, for 
instance, how in control she felt over her life 
— the shorter were her telomeres. The extra 
telomere shortening in the ‘most stressed’ 
mothers (compared 


with that in the ‘least “Telomeres 
stressed’ mothers) powerfully 
was equivalent to that quantify life’s 
caused by at leasta insults.” 


decade of ageing. 

This relationship between stress and 
telomere length keeps showing up: from 
studies of kindergarten-aged children 
to adults as old as 80; from small clinical 
samples of less than 100 people to large 
population-based samples of thousands”. 

Under stress, the body ramps up its pro- 
duction of certain hormones, such as cor- 
tisol, and other biochemical factors. These 
compounds help to mediate an appropriate 
response to short-term stress. But when 
overproduced for months or years, they can 
alter gene expression, probably with deleteri- 
ous effects (see page 171). In the laboratory, 
the same factors can shorten telomeres — in 
the case of cortisol, by reducing the activity 
of telomerase. It is likely that the pathways 
that mediate alterations to gene expression 
interact with those affecting telomere main- 
tenance, although this has yet to be explored. 

Although many studies have unearthed 
(and continue to uncover) associations 
between stress and eroded telomeres, others 
have forged links between telomere shortness 
and common disorders. Rare mutations of 
genes encoding components of telomerase 
cause telomeres to be too short. This results 
in immune-system disorders such as aplastic 
anaemia, and other conditions including pul- 
monary fibrosis, diabetes, some cardiovascu- 
lar diseases and certain cancers. Remarkably, 
many of these inherited diseases, recently 
named ‘telomere syndromes”, are those that 
are commonly associated with ageing. 

Telomere shortness can even predict peo- 
ple’s statistical risk of developing certain 
conditions. For instance, for 10 years after 
their telomeres had been measured, men and 
women with shorter telomeres were three 
times more likely than those with longer 
ones to develop certain types of cancers such 
as pancreatic cancer’. If elderly, people with 
shorter telomeres were also 50% more likely 
than those with longer telomeres to develop 
dementia and 50% more likely to die from 
any particular cause’. 

Evidence that lifestyle, well being and 


other environmental factors can contribute 
significantly to disease has been accumulat- 
ing for decades. But we now have three pair- 
wise links involving three factors: stress with 
telomere shortness; stress with disease risks; 
and telomere shortness with risks for these 
diseases. It is hard to avoid the inference that 
at least one of the ways stress causes chronic 
diseases is by shortening telomeres. 


TIME TO ACT 

How can policy-makers act now on the 
wealth of evidence that is accumulating from 
studies of telomeres? 

Dissecting the cellular and physiological 
pathways linking unusually short telomeres 
to stress and disease could prove important 
for finding possible targets for treatments. 
But policy-makers don't need to wait for all 
the mechanistic details to be filled in (which 
could take decades) — especially given that 
what happens in the laboratory often fails to 
reflect what is going on in the body as a whole. 

A striking message from the telomere story 
is the importance of considering environ- 
mental as well as genetic effects in addressing 
disease. From 2008 to 2011, the US National 
Institutes of Health (NIH) allocated a total of 
US$29 million to genetic research. By com- 
parison, over the same period, it gave only 
$14 million to all behavioural- and social- 
sciences research. This is despite several 
analyses attributing roughly 50% of the vari- 
ance in early mortality in the United States to 
largely modifiable behaviours such as over- 
eating, alcohol abuse and smoking (which are 
also stress-related in part)’. 

Using current technologies, people's gene 
variations are easier to track down than the 
plethora of influences coming from outside 
the body, let alone how people behave, think 
and feel. Yet capturing a person's vulnerability 
to disease will require understanding all these 
inputs. Telomere length provides an espe- 
cially good window into global physiological 
status and isa bellwether of disease. (Interest- 
ingly, it is often more affected by a person's 
life experience than by which variants of 


TELOMERES TELL 


They are shorter in adults who experienced 
more trauma as children. 
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telomere-maintenance genes they carry.) 
Telomere research also points to some 
practical ways to improve health. Mice that 
are genetically deprived of telomerase quickly 
become wizened and grey but such changes, 
normally associated with ageing, can be at 
least partly reversed by restoring telomerase 
activity’. Designing drugs to boost telomer- 
ase in humans without inducing unwanted 
side effects is a formidable challenge. More 
feasible approaches to alleviating telomere 
shortening could involve mitigating the con- 
ditions that lead to chronic stress and helping 
people to change certain behaviours. 
Encouragingly for this latter approach, 
some pilot studies suggest that just three 
months of stress-reduction interventions, 
often along with increased physical activ- 
ity and dietary changes, may slow or even 
reverse telomere attrition by increasing tel- 
omerase activity. An intriguing challenge is 
to see whether the idea of telomeres erod- 
ing (which many have told us conjures up 
a striking image of declining health) could 
motivate people to change their behaviour. 


START YOUNG 

Individual efforts apart, perhaps the strong- 
est message to come from the work on tel- 
omeres is that to pre-empt many common 
diseases, especially those that are becoming 
increasingly prevalent in the world’s ageing 
population, governments and other policy- 
makers need to prioritize what we call ‘soci- 
etal stress reduction. Meditation retreats or 
yoga classes may help those who can afford 
the time and expense. But we are talking 
about broad socioeconomic policies to buffer 
the chronic stressors faced by so many. 

Together, the results from studies of 
telomeres, stress and disease reinforce a 2008 
recommendation from the World Health 
Organization (WHO) for reducing health 
inequities. The WHO proposes that govern- 
ments and global health organizations focus 
on improving education and the conditions 
surrounding early child development’. 

Several studies indicate that stress begins 
eroding telomeres in infancy, and maybe 
even before children are born. For example, 
studies have found that the more violence 
children had experienced, or the longer they 
had spent in an orphanage, the shorter were 
their telomeres’. Even young healthy adults 
whose mothers had experienced severe 
stress while pregnant (for instance, because 
ofa close family member dying) had shorter 
telomeres than people whose mothers had 
relatively stress-free pregnancies”. 

What is more, the effects of stress early in 
life reverberate into adulthood. In more than 
4,000 middle-aged UK women, for example, 
the more categories of adversity each woman 
had experienced as a child (such as physical 
abuse or parental divorce, unemployment or 
drug use), the shorter were her telomeres’. 


SOURCE: J. K. KIELCOLT-GLASER ET AL. PSYCHOSOM. MED. 73, 16-22 (2011). 


This has also been found in the elderly 
(see “Telomeres tell’). Even experiencing 
fewer years of school education in early 
life is associated with having shorter tel- 
omeres in middle and old age. 

Delaying actions that mitigate diseases 
such as diabetes until adulthood will only 
exacerbate personal and societal costs. 
One example of a proactive step is the 
US health programme Medicaid’s Strong 
Start initiative, which aims to enhance 
pregnancy-related care. Improving the 
education and health of women of child- 
bearing age in general could be a highly 
effective way to prevent poor health 
filtering down through generations. 

To suggest that people’s quality of life 
matters or that societies and governments 
should be allocating more resources to 
mothers and children is hardly new or 
controversial. What is new is the wealth 
of evidence demonstrating that telomeres 
powerfully quantify life’s insults. They are 
shorter in people who were exposed to 
adversity as children, and shorter still for 
each year a person spends depressed, car- 
ing fora sick child, being abused and so on. 

Telomeres send one more signal — 
from the tips of our chromosomes — that 
unmanageable social and psychologi- 
cal stress, especially during early life, 
is as insidious as smoking or too much 
fast food. mSEE COMMENT P.17I 
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tress makes its 
molecular mark 


Trauma affects people differently. Epigenetics may be 
partly to blame, says Eric J. Nestler. 


ome people exposed to severe stress, 

such as that caused by prolonged eco- 

nomic hardship or sexual or physical 
abuse, go on to develop devastating psycho- 
logical or other health problems. Others are 
more resilient. If one identical twin shows 
symptoms of stress-related depression, for 
instance, the other will also be depressed in 
only around 40% of cases. I believe that epi- 
genetic mechanisms help to explain why’. 
These are experience-dependent molecular 
alterations to DNA or to proteins that alter 
how genes behave without changing the 
information they contain. 

Recent studies suggest that epigenetic 
mechanisms shape short-term (lasting hours) 
and long-term (lasting months, years or even 
a lifetime) responses to stress. Some studies 
even hint that epigenetic changes could affect 
the next generation. A serious effort to both 


A STRESS AND RESILIENCE 


The links between adversity and 
mental illness. nature.com/stress 
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map and substantiate associations between 
behavioural responses and epigenetic alter- 
ations — although costly and challenging — 
would almost certainly flag up possibilities 
for treatments that either reverse the effects 
of stress or enhance a person's ability to cope. 


AGGRESSIVE MICE 

When a person is stressed, gene expression 
in parts of the brain may be up- or down- 
regulated. This can occur through chemical 
modifications to DNA, to regulatory pro- 
teins in the nuclei of brain cells or to histones 
(proteins that package and order DNA). 
Many stress-induced changes are adaptive, 
but some seem to be damaging. 

In my laboratory, we have stressed mice by 
repeatedly exposing them to more aggres- 
sive mice’ (see ‘A switch for resilience’). After 
ten days of this treatment, the stressed mice 
begin to avoid other mice, show less interest 
in things that normally excite them (such as 
sweets and sex), become less adventurous 
and even grow obese (they take less pleasure 
in eating but eat more). > 
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ILLUSTRATION BY PADDY MILLS 


> Many of these symptoms can persist for 
months and are treatable with standard anti- 
depressant medications. We have also found 
that mice given cocaine the week before being 
exposed to an aggressive mouse have more 
extensive epigenetic modifications, which 
induce more stress-related symptoms’. 

Of the hundreds of mice studied in my lab, 
roughly one-third become less adventurous 
when stressed but have no other symptoms. 
By looking at differences in gene expression 
and structural organization of DNA between 
these ‘resilient’ mice and more susceptible 
mice, we have linked distinct behavioural 
responses to specific molecular alterations 
— all in regions of the brain important in 
reward recognition’ °. These alterations 
include differences in DNA methylation, 
patterns of attachment of acetyl or methyl 
groups to histones and activity of various 
transcription factors. They last for days or, 
in some cases, for several weeks. 

We can make susceptible mice resilient 
by blocking or inducing epigenetic mod- 
ifications to certain genes or by altering the 
expression patterns of those genes to mimic 
the epigenetic tweaks. Likewise, epigenetic 
modifications and gene expression can be 
altered in resilient mice to make them more 
susceptible. 

Other groups have found similar epi- 
genetic alterations that last a lifetime. For 
instance, rat pups that are rarely licked and 
groomed by their mothers are more suscep- 
tible to stress later in life’ than are pups with 
more diligent carers. They are less adven- 
turous than better-cared-for offspring and 
put up less ofa fight in unpleasant situations 
(such as being placed in a beaker of water). 
Moreover, the females are less nurturing 
towards their own offspring. Epigenetic 
modifications seem to occur at several 
genes in the hippocampus in response to 
how much grooming young rats receive, 
and these alterations persist into adulthood’, 

These findings are likely to hold up in 
humans. For example, researchers have found 
that the genes identified in the rat-grooming 
studies were more methylated in the hip- 
pocampi of suicide victims who had experi- 
enced traumaas children than in the those of 
people who had died from suicide or natural 
causes and whose childhoods were normal. 
Likewise, our findings in mice given cocaine 
mirror epidemiological studies from the past 
few decades that have linked drug abuse, obe- 
sity and conditions such as multiple sclerosis, 
diabetes and heart disease to increased sus- 
ceptibility to stress in humans. 

More controversial is whether animals 
inherit epigenetic vulnerability to stress. 
According to this notion, epigenetic modi- 
fications in sperm or eggs drive aberrant 
patterns of gene expression in the next gen- 
eration®. Several groups have reported that 
male mice exposed to stress — by being 


A SWITCH FOR RESILIENCE 


Mice become shy of other mice after repeated 
exposure to aggressive peers. Mimicking certain 
epigenetic tweaks makes them bolder. 
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removed from their mothers as pups or 
exposed to more aggressive mice as adults, 
for example — produce offspring that are 
more vulnerable to stress””°. 

A mechanism is still elusive. Exposure 
to stress could somehow corrupt the male 
mouse’s behaviour or affect some signalling 
molecule in his semen such that his partner 
alters her care for their young. Another 
possibility is that stress-linked epigenetic 
‘marks’ in the sperm affect the develop- 
ment of offspring”’®. No causal evidence yet 
links epigenetic changes in sperm to altered 
behaviour in offspring. 


MAPPING MARKS 

Epigenetics is in vogue: over the past five 
years, researchers have proposed epigenetic 
explanations for all sorts of phenomena, from 
language acquisition to obesity, without clear 
proof. At a meeting I attended two years ago, 
people proposed that the spread of Christian- 
ity in the early centuries AD was partly due 
to epigenetic mechanisms. Furthermore, 
researchers too often identify correlations 
between behaviour and molecular alterations 
in cells without also establishing a causal 
link. Some biologists are, rightfully, wary. 

Yet the results I have described show how 
important epigenetic mechanisms are likely 
to be in understanding the effects of stress 
and in discovering ways to manage it. 

It is time for researchers to begin the 
difficult business of substantiating associa- 
tions. Cost is one challenge to pinpointing 
the genes and biochemical pathways 
involved in epigenetically mediated 
responses to stress. Hundreds of known 
types of modifications probably act in 
complex combinations. Mapping each mark 
at specific points in development in a rat or 
mouse, and doing so in each brain region 
or peripheral tissue, would cost tens of 
thousands of dollars. Defining the alterations 
in the many cell types ina given brain region” 
pushes the cost several-fold higher. And the 
expense for doing so in humans is orders 
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of magnitude higher still. Human genetic 
diversity means that researchers will prob- 
ably need to study hundreds or thousands 
of people to obtain a meaningful picture. A 
related challenge is that of obtaining enough 
computational power to analyse the hundreds 
of terabytes of sequencing data that would be 
produced — although recent advances in bio- 
informatics are beginning to help. 

At present, researchers study epige- 
netic alterations by up- or downregulating 
enzymes such as histone methyltransferases. 
But such enzymes can influence thousands 
of genes. Tools that allow researchers to 
target a specific type of epigenetic modifica- 
tion to a single gene in a particular cell type 
in vivo will usher in a far more compelling 
phase of research. 

Work in simpler organisms, such as the 
roundworm Caenorhabditis elegans, is pro- 
viding clues about the range of epigenetic 
modifications that can occur in sperm or egg 
cells. Of course, experiments in mammals 
willbe needed to establish whether epigenetic 
transmission of information can occur across 
generations to a meaningful extent. 

Thirty per cent of lost productivity world- 
wide is caused by psychiatric conditions 
such as depression, anxiety and schizo- 
phrenia — all of which are exacerbated by 
chronic stress. In the developed world, that 
proportion is 40%. The toxic levels of stress 
people are now exposed to — in part thanks 
to the increases in productivity, lifespan and 
competitiveness that come with a wealthier, 
healthier globalized economy — are here to 
stay. A serious endeavour to understand why 
people respond to stressful experiences so 
differently — challenging as it would be — is 
easily justified. m SEE COMMENT P.169 
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Some people need more rest than the prescribed eight hours a night — or need it at different times. 


Broken dreams 


Meredith Wadman lifts the blanket on the creeping 
medicalization of sleep in the United States. 


illions of people in the United 
States struggle to achieve that 
great American dream, a ‘good 


night’s sleep. So says Matthew Wolf-Meyer 
in his ambitious The Slumbering Masses. He 
contends that capitalist necessity defines 
sleep in the country today, shoehorning 
sleepers into a societally convenient but 
physiologically arbitrary eight-hour night. 

Those who can't manage the prescribed 
amount of slumber at the prescribed time 
are often labelled disordered sleepers. Wolf- 
Meyer's message is that society should bend 
to accommodate, even celebrate, diversity 
in sleeping behaviour, rather than branding 
nonconformism pathological. 

For now, night owls and others who fail 
to adapt to the eight-hour, nocturnal norm 
— whether owing to disease or a particular 
hard-wired biology — are a boon to the phar- 
maceutical industry. Wolf-Meyer, an anthro- 
pologist, calls for a shift towards more flexible 
organization of workdays, school and social 
lives, and away from the assumption of mon- 
olithic “slumbering masses”. Otherwise, he 
warns, “Americans may be doomed toa future 
of proliferating sleep disorders, amphetamine 
breakfasts, and sedatives for dinner” 


Wolf-Meyer lays 
much responsibility 
for the medicalization 
of sleep at the feet of 
a US sleep-medicine 
establishment that has 
grown up since the 
1950s. Its roots, how- 
ever, emerge in his 
fascinating history of 
the Protestant origins a 
of sleep in the United Menenie ane 

: : Modern American 
States. The influential | jfa 
Puritan minister Cot- = yar 
ton Mather argued in 
the late seventeenth 
century that those 
with a proclivity for 
the luxuries of slumber were failing in their 
earthly and God-given duty to be productive. 
The bed, Mather opined, is one of just a few 
places where “the Devil has laid out most fatal 
snares”. A generation later, Benjamin Frank- 
lin turned the same message positive with his 
still-famous dictum, “Early to bed, early to 
rise, makes a man healthy, wealthy and wise.’ 

This morality handily converged with 
the twentieth-century idea of ‘normal 
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BOOKS & ARTS 


sleep’: a consolidated nocturnal experience 
programmed by biology, but potentially 
moulded by behaviour modification and, 
of course, drugs. The work of sleep-science 
pioneer Nathaniel Kleitman, who in 1938 
descended into Mammoth Cave in Kentucky 
to try to realign his biology to a 28-hour day, 
thrust an awareness of sleep research into 
the public eye. It also reinforced the idea 
that sleep can be controlled by willpower — 
a concept further cemented by Kleitman’s 
student, William Dement. 

Dement founded the first sleep laboratory, 
at Stanford University in California, in the 
early 1970s. He understood healthy sleep “as 
resting upon a biological foundation of eight 
quiet, motionless and consolidated hours of 
sleep ... between sunset and sunrise’, writes 
Wolf-Meyer, arguing that Dement’s The 
Promise of Sleep (co-written by Christopher 
Vaughan; Delacorte, 1999) “promotes a 
model of nature and human biology from 
which all variations are disorders”. Work 
such as Dement’s has, in Wolf-Meyer’s view, 
been one factor in helping to pave the way 
for the modern pharmaceutical industry to 
find a broad new market among disordered 
US sleepers. Many of these sleepers are given 
the catch-all diagnosis of ‘excessive daytime 
sleepiness; the use of which in the medical 
literature has exploded in the past ten years. 

Those looking for an exhaustive pharma- 
copeia of the sleep-medicine industry will 
not find it here. Wolf- Meyer notes that US 
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> drug-maker Sepracor spent “hundreds 
of millions of dollars” on the US launch of 
the sedative Lunesta (eszopiclone). But 
he offers only one example to support his 
contention that companies are widening 
their net: pharmaceutical firm Cephalon’s 
successful effort to expand the user base 
for the stimulant Provigil (modafinil) from 
people with narcolepsy to those with sleep 
apnoea and ‘shift-work sleep disorder’. 
Neither has Wolf-Meyer tracked down the 
associated and — he implies — growing 
revenue numbers for leading stimulants 
and sedatives. Such figures would buttress 
his claim that in “a few short years, exces- 
sive sleepiness may successfully become 
the new erectile dysfunction” 
Wolf-Meyer might have delved deeper 
ifhis scope had been narrower. As itis, he 
covers everything from the socialization 
of children to sleep at the appropriate hour S eae - 
(think of Margaret Wise Brown's bedtime In the early twentieth century, unpaid patient labour kept Kings Park hospital self-sufficient. 


classic Goodnight, Moon) to the use and 
abuse of caffeine and other stimulants, PSYCHIATRY 


and the plight of workers in Indian call . 
centres, forced to synchronize their hours 
“itinkaammented | | NE dispossessed 
treatment pays dividends. It is illuminat- 
ing, for instance, when Wolf-Meyer takes A M : : : : fil th { h : 
us inside a Minnesota sleep clinic where my Maxmen views a prizewinning fim that shines 


medics are struggling to help a sleepless a light into the dark corners of US psychiatric care. 
young girl already prescribed five drugs. 


Implicit in Wolf-Meyer’s analysis is that 


the medicalization of sleep is a profit- ucy Winer checked into Kings Park institutions. Although Kings Park: g 
driven pursuit. No doubt economics did Levies hospital on Long Island, the drugs adminis- StoriesFroman = @ 
indeed have a big role in the rise of sleep New York, after overdosing on sleeping tered to people with eet Mental g 
medicine, but the author finds little room pills and slitting her wrists. It was 1967; she serious mental ill- essen ‘ney 7 
for the real benefits of some treatments. was 17. Stern nurses dressed her in a hospi- ness are arguably less wien = 
Can it be bad when medication prevents tal gown and escorted her into a room where dangerous now than __wildlight Productions: = 
a narcoleptic from falling asleep at the identically dressed women slept on the floor they were inthe 1960s 2012. S 
wheel? And I, as a sleep apnoeic with or leaned lifelessly against walls. The women and therapy is widely kingsparkmovie.com 8 
daily work and family responsibilities, am were literally floored by antipsychotic medi- accepted, not every- z 
indebted to the machine that keeps my cations that, as Winer was to find in her six one who needs these advances receives 8 
airway open at night and prevents what months in the hospital, felt like an iron suit. them. In the United States, more than 10% 3 
would otherwise be literally hundreds of “We had been thrown away, stripped, of people with serious mental illness arenow 2 
sub-conscious wakenings. locked up. We were disposable,” says Winer, homeless, or in prison (L. Davis et al. Curr. & 
The Slumbering Masses suffers in in the documentary Kings Park. Winer Psychiatry Rep. 14, 259-269; 2012). & 
many places from jargon. Sentences directed and co-produced the film 30 years Winer’s intent, too, is to shine alight on Z 
such as, “Normative desire facilitates the after her stay at the hospital, now long aban- the dehumanized patients — and over- = 
functioning of everyday spatiotemporal doned. Kings Park tells a tale of mental health whelmed doctors — challenged by inade- g 
hegemony and is in turn formed through care that must be told, she says. The psychia- quate resources for mental-health treatment. 3 
that very same hegemony” made me long trists who are now showing it at meetings and She spent 11 years creating the film to 2 
for a lay-friendly translation. workshops around the United States agree: explore her terrifying personal experience . 
But there are passages of telling clar- last month, the New York Association of with mental illness as a teenager. As she 
ity. Wolf-Meyer tells the story of Betsy, Psychiatric Rehabilitation Services presented turns her gaze outwards in interviews with 
a woman in her 50s who fought insom- Winer with the 2012 Public Education/ psychiatrists, attendants and other former 
nia for decades. She tried, she confesses, Media award. Kings Park touches a nerve. patients, Kings Park also becomes a history 
“lots and lots of drugs. Everything from The psychiatrists’ goal is to inspire pro- of US psychiatry told from multiple per- 
benzo[diazepines] to Xanax, antidepres- gress by conveying to mental health-care spectives. The journey ends in the present, 
sants, and all the tricyclics [as well as] providers how it felt to be on the receiv- at the assisted-living centres and prisons 
muscle relaxers mixed with other drugs. ing end of deficiencies in state mental where many former residents of psychiatric 
And they're effective for a while, and then hospitals live. Most large state-run men- 
they all wear off? m Z STRESS AND RESILIENCE tal hospitals in the United States shut their 
: ; doors over the course of four decades: 
Meredith Wadman is a correspondent Lt bagel Ni between 1955 and 2003, the number of inpa- 
for Nature based in Washington DC. i : tients in them dropped by more than 90%. 
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Kings Park was comparable to other US 
state hospitals, but larger and more venerable 
than many. It opened in 1885 as an asylum 
for people with mental illness in New York 
City. In the 1950s, it housed roughly 9,000 
patients at any one time. In the same dec- 
ade, Thorazine (chlorpromazine), an anti- 
psychotic drug prescribed for schizophrenia 
and other psychiatric disorders, came to 
Kings Park. It replaced cruder means of qui- 
eting patients, such as lobotomy and electric- 
shock therapy, but caused slurred speech, the 
writhing and shaking of tardive dyskinesia 
and other distressing side effects, as doctors 
experimented with various formulations. 

Understaffing occasionally led to patients 
being harmed when attendants rushed to 
quiet them. Hannelore Lehnhoff, a psy- 
chiatrist at Kings Park from 1960 to 1985, 
is noticeably distraught as she recalls an 
extreme example in which a patient seemed 
to have been suffocated with a pillow by 
nurses. Winer captures the frustration 
that psychiatrists felt with too little time to 
address the needs of so many patients, and 
alack of tools to tend to those they saw. The 
sombre tone lifts momentarily with the news 
of the Community Mental Health Act signed 
by President John Kennedy in 1963, which 
outlined how small, assisted-living situations 
could provide an alternative to massive state- 
run institutions. Funding to Kings Park fell 
soon after the act was passed, and the hospi- 
tal began to downsize. It shut in 1996. 

At the assisted-living facilities visited in 
Kings Park, former patients cook, give tours to 
prospective residents and their families, and 
discuss in peer-support groups how much 
better it is to be needed than to be abandoned. 

But such facilities were and remain rare. 
Kings Park shows how federal and state budg- 
ets were scaled back as state hospitals closed, 
and many patients ended up on the streets. 
A trembling former Kings Park resident with 
schizophrenia, who is now in prison, speaks 
to camera about how he was arrested for 
sleeping outside a church. In scenes at Suffolk 
County Correctional Facility in New York, we 
hear that one-fifth of the inmates have been 
diagnosed as mentally ill. Once more, their 
clothes have been swapped for uniforms. 

After a screening at the American Psychi- 
atric Association’s annual meeting in May, the 
discussion lasted for more than an hour. For 
Evelyn Bromet, a professor of psychiatry at 
Stony Brook School of Medicine in New York, 
Kings Park offers a provocative way to teach 
history to avoid repeating it. “There are men- 
tal-health researchers who have no apprecia- 
tion for what state hospitals were like,’ says 
Bromet. “Winer is telling the story ofan enor- 
mous group of people who are forgotten.” m 


Amy Maxmen is a freelance journalist in 
Brooklyn, New York. 
e-mail: amaxmen@gmail.com 


Books in brief 


Bad Pharma: How Drug Companies Mislead Doctors and Harm 
Patients 

Ben Goldacre FOURTH ESTATE 448 pp. £13.99 (2012) 

Psychiatrist and firebrand Ben Goldacre puts big pharma in the dock. 
Skewering an industry riddled with ‘side effects’ — from suppressed 
trial results to diseases invented for profit — and backed by poor 
regulation and aggressive marketing, Goldacre also offers pragmatic 
solutions. Further exposure of a seam mined by, among others, 
Marcia Angell, former editor-in-chief of The New England Journal 

of Medicine (The Truth about the Drug Companies; Random House, 
2004). Hear an interview with Goldacre at go.nature.com/huccmd. 


Bad Pharma 
Ben Goldacre 


Bestselling author of Bad Science 


On The Map: Why the World Looks the Way it Does 

Simon Garfield PROFILE Books 468 pp. £16.99 (2012) 

The sixteenth-century Mercator world projection has aged well — 
Google Maps uses it. Cartography, asserts Simon Garfield, evolves 
endlessly but is rooted in enduring needs: to discover frontiers, plot 
progress, keep our bearings. The myriad maps he shows are doorways 
into key moments in cartographic history, from Venetian monk Fra 
Mauro’s 1459 world map, the last hurrah of the medieval planisphere, 
to Gordon Home's ruler-like rendering of Roald Amundsen’s 1911 
route to the South Pole. Disease mapping, brain mapping, gaming — 
charting the world has morphed into mapping worlds within worlds. 


Double Entry: How the Merchants of Venice Created Modern 
Finance 

Jane Gleeson-White ALLEN & UNWIN 304 pp. £12.99 (2012) 

Intrigued by the economic engine driving Renaissance art, Jane 
Gleeson-White crafted this gem of a history. It hinges on monk Luca 
Pacioli, who published the first treatise on double-entry bookkeeping 
—a mercantile system underpinning today’s global economy. There 
are intriguing circularities: Pacioli probably learned mathematics 
from artist Piero della Francesca, then helped Leonardo da Vinci 
with linear perspective. Gleeson-White traces the system’s impact 
through Keynesian successes and today’s high-finance excesses. 


Apocalyptic Planet: Field Guide to the Everending Earth 

Craig Childs PANTHEON BOOKS 368 pp. £17.46 (2012) 

Mass extinctions, eras of “equatorial seas bobbing with slush”, 
wall-to-wall desert — Earth’s cycles of destruction, science writer 
Craig Childs reminds us, are dramatic, relentless and constant. 
Which end of the world will be next? Childs travelled to nine 
apocalyptic places for a taste of possible future cataclysms. Each 
snapshot is research-rich: we get the swirling of warm air cells 
towards the poles on a trek in North America’s Sonoran Desert, and 
the “genetic exhaustion” of lowa on a gruelling hike through vast 
cornfields. Science shot through with real lyricism. 


’Pataphysics: A Useless Guide 
Andrew Hugill MIT PRESS 296 pp. £17.95 (2012) 
For a science that doesn’t exist, ’pataphysics has popped up 
— with happy regularity for decades — ever since playwright and 
exquisite jokester Alfred Jarry conceived of the proto-Dadaist 

} ‘science of imaginary solutions’. Andrew Hugill, a professor of digital 
humanities, has spent 25 years charting ‘pataphysics in science 
and the arts. He teases out its influence on everything from the 
philosophy of Jean Baudrillard to the novels of Philip K. Dick — and 
/ suggests that Jarry may have pre-empted theories of antimatter. 


PATAPHYSics 
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Correspondence 


Drop re-review for 
big grant holders 


The fundamental concern witha 
second layer of review for holders 
of more than US$1 million in 
grants from the US National 
Institutes of Health is that it takes 
us further from a meritocracy 

in which the best and highest- 
impact science is supported. 
(Nature 489, 203; 2012). 

Elite grant recipients have 
gone through reviewers’ scrutiny 
and proved their productivity 
many times. Reviewers and grant 
administrators already check 
for potential funding overlap, 
one of the issues the reforms 
are designed to address. Peer 
review may be imperfect, but it 
is done by the scientific leaders 
in the relevant field, and there 
is no reason to believe that the 
judgement of an advisory council 
is superior. 

The reluctance of most 
funding agencies to pay 
for research infrastructure 
necessitates either leveraging 
economies of scale from 
multiple grants or belonging 
to an academic institution that 
is wealthy enough to share 
the costs — a position not all 
researchers enjoy. The new 
policy would mainly affect 
the most productive principal 
investigators with multiple 
RO1-type (individual project 
application) grants, widely 
considered to yield the most 
innovative research. 

Taxpayers have the right to 
make the scientific community 
accountable for improving 
health, but this extra scrutiny 
may compromise attainment of 
that goal. 

Hemant K. Roy Northwestern 
University, Evanston, Illinois, USA. 
h-roy@northwestern.edu 


EU scientific visa is a 
success in Ireland 


From experience in Ireland, I 
contend that Europe’s scientific 
visa and the EURAXESS 
services can solve most of the 


problems regarding entry of 
foreign scientists into European 
research institutions and 
companies (Nature 488, 557; 
2012). Fast-track immigration is 
an important consideration for 
internationally mobile scientists 
and helps to attract the best 
global talent to Europe. 

Ireland, like the United 
Kingdom and Denmark, was 
not obliged to introduce the 
scientific visa under the 2005 
European Union (EU) directive, 
but it was recognized that 
the visa would help to attract 
non-EU researchers to the 
country. The scheme, which 
offers a free and fast service, has 
been successfully operating in 
Ireland since 2007 and is open to 
universities and companies. 

Visas are issued rapidly and 
work permits are not required. 
Researchers’ families can 
accompany them immediately 
and use public schooling. 
Family members have access 
to the job market and can stay 
on to look for a job after the 
contract ends. 

More than 200 EURAXESS 
offices in 40 European 
countries, including Ireland, 
help foreign researchers 
to move across Europe 
by providing practical 
information. The EURAXESS 
portal (http://ec.europa.eu/ 
euraxess) is a free advertising 
forum for EU research 
positions. 

Conor O’Carroll Irish 
Universities Association, Dublin, 
Treland. conor.ocarroll@iua.ie 


Who discovered 
Universe expansion? 


Controversy persists over who 
first found that the Universe 
is expanding. Last year, Mario 
Livio quashed suggestions 
that Georges Lemaitre’s 1927 
theoretical prediction of 
expansion was deliberately 
suppressed (Nature 479, 171- 
173; 2011). Since then, another 
contender has emerged. 

The joint NASA and Infrared 
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Processing and Analysis Center 
Extragalactic Database of 
Galaxy Distances, in Pasadena, 
California, which I co-lead, has 
tabulated and made public the 
historical distance estimates 
published by Edwin Hubble 
and his contemporaries to 
prove expansion (see I. Steer 

J. R. Astron. Soc. Can. 105, 
18-20; 2011). These reveal that 
measurements by a Swedish 
astronomer, Knut Lundmark, 
were much more advanced than 
formerly appreciated. 

Lundmark was the first 
person to find observational 
evidence for expansion, in 1924 
— three years before Lemaitre 
and five years before Hubble. 
Lundmark’s extragalactic 
distance estimates were far 
more accurate than Hubble's, 
consistent with an expansion 
rate (Hubble constant) that 
was within 1% of the best 
measurements today. 

However, Lundmark’s 
research was not adopted 
because it relied on one 
unproven method (galaxy 
diameters), cross-checked 
with one unproven distance to 
the Andromeda galaxy, which 
was derived from a type Ia 
supernova observed in 1885 
and mistaken for a normal nova 
(W. Huggins and W. F. Denning 
Nature 32, 465-466; 1885). 

Hubble's research in 1929 
yielded a value for the Hubble 
constant that was inaccurate by 
almost an order of magnitude. 
It was adopted because it was 
derived from multiple methods 
— including one still in use 
(brightest stars) — and was 
cross-checked with multiple 
galaxies with distances based on 
proven Cepheid star variables. 

Lundmark established 
observational evidence that 
the Universe is expanding. 
Lemaitre established theoretical 
evidence. Hubble established 
observational proof. 

Tan Steer NASA/IPAC 
Extragalactic Database of 
Galaxy Distances, Pasadena, 
California, USA. 
ian@colosseum.com 


Environment in 
Queensland at risk 


In the state of Queensland, 
Australia, hard-won 
environmental protections are 
under threat. 

In April this year, 
Queensland elected a new 
government that is pro- 
development and pro-mining. 
These activities have been 
burgeoning over the past few 
years, prompting the United 
Nations Educational, Scientific 
and Cultural Organization 
(UNESCO) to investigate 
whether the Great Barrier Reef 
should be included on the List 
of World Heritage in Danger 
(see http://whc.unesco.org/en/ 
documents/117104). 

The government is also 
moving to dismantle the 
Wild Rivers Act, which was 
designed to protect and manage 
Queensland’s last remaining 
pristine rivers and catchments 
in Cape York. These rivers 
have the richest diversity of 
freshwater fish in Australia, 
including the sawfish Pristis 
microdon and the speartooth 
shark Glyphis glyphis. These are 
considered by the International 
Union for Conservation 
of Nature to be critically 
endangered and endangered, 
respectively. Australia is one of 
the last countries in the world 
where populations of these fish 
are viable. 

Ministers have nevertheless 
declared that a regional plan for 
Cape York will “unashamedly” 
fast-track development and 
“absolutely” secure a place 
for mining in these areas (see 
go.nature.com/e6hbzc). 

These threats to Australia’s 
aquatic environment are being 
compounded by government 
cuts to staff in environmental 
and fisheries management 
across public institutions. 
Andrew Chin, Jimmy White 
James Cook University, 
Townsville, Australia. 
andrew.chin@jcu.edu.au 
Nick Dulvy Simon Fraser 
University, Burnaby, Canada. 


Ua SCIENCE FICTION 


TEMPORAL VENTURES ROBBED ME 


BY SCOTT C. MIKULA 
Posted: Nov. 17, 2029 8:03 PM EST 


Note: Since TV DELETED my review 

on their website, I am reposting this on 
rateyourtrip.com where they can't touch it. It 
says something that they felt like they had to 
CENSOR me, don't you think? 


Don't let Temporal Ventures lure you in 
with their false advertising! They're just 
trying to cheat you out of your money, 
or if youre really lucky maybe they'll 
thump you on the head and pry it from 
your fingers in the name of ‘science’ or 
‘intertemporal law. But I’m getting ahead 
of myself. 

I enrolled in Temporal Venture’s ‘Classic 
Caribbean’ package. Take a moment to see 
what the trip to the year 1720 is SUPPOSED 
to include: http://temporalventures.com/ 
packages?p=39284. Treasure! Pirates! 
Adventure! Sound exciting? Let me tell 
you how the ‘adventure’ actually goes. 

First off, the time jump makes you 
feel like SHIT, like after a night of 
drinking with my ex-wife. And 
when you finally crawl out of bed 
do they let you enjoy the sun? No, 
they have a two-hour orienta- 
tion video. Maybe you'll make it 
past the first mention of tempo- 
ral nodes or causality thresholds, 
but I didn’t. Yawn. I didn’t pay for 
a science lesson. 

Then you meet wet-blanket 
Randy. He’s the teur-gttide fun 
police for the trip. You know the 
type: little guy, glasses, with a stuck 
up, 'm-smarter-than-you attitude, 
always monotoning about what you 
should or shouldn't be doing. I had to 
spend most of my morning tuning him 
out. When I asked about treasure hunting, 
stick-up-his-butt Randy made some excuse 
about liability and pointed me to the ‘pirate’s 
wreck tour — some old ship that got caught 
up on a reef and I guarantee you doesn't 
have an ounce of treasure on it by now. 
Meanwhile all the other guests are content 
to lay on the beach or sit at the bar and blab 
about how clear the sky is and that the air 

smells so fresh. What- 


> NATURE.COM ever. Come home witha 
FollowFutureson —_ load of Spanish treasure 
Facebook at: and you can invest ina 
go.nature.com/mtoodm = — personal holodome and 


Wish you were here? 


next-gen atmo-filter. You'll be able to lay on 
a beach whenever you want. 

Oh, and pirates? There was ONE pirate. 
Greasy Sid had figured out how to panhandle 
from the tourists —which goes to show that 
bums will be bums no matter what century 


itis — and I guess some intertemporal treaty 
says they're not legally able to run him off. I 
think he had tuberculosis or something, but 
I talked to him anyway. Well, with the right 
incentive (hint: pirates like Bacardi Gold way 
better than whatever booze they’re used to) 
HE, at least, was willing to talk about treas- 
ure. Even drew me a map on the back of a 
cocktail napkin. 

So I, er, ‘borrowed’ some scuba gear 
and a skimmer. And damned if Greasy Sid 
didn’t know what he was talking about. I 
had to swim through an underwater cave 
and almost got skewered by poisoned darts 
and I probably set off some ancient voodoo 
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curse, but I found it. Treasure like you've 
never seen! Chests and sacks of coins, heaps 
of gems, cups and plates of gold and silver, 
and enough jewellery to make my ex-wife's 
eyes pop. 

I tried to send my ex a picture with a little 
note about how she thought I'd never make 
anything of myself and as she was always 
knocking my ‘schemes; she could just say 
goodbye to seeing any part of the haul, but 
— you guessed it, my cell showed no signal. 
Turns out they don’t have coverage ANY- 
WHERE back then. Something about trans- 
mitting across the time link, I don't know. I 
mean, they're scientists, for god’s sake! They 
should be able to figure something out. 

Anyway, I bundle up what I can carry, 
head back to the resort, and guess who meets 
me dockside? Sonovabitch Randy, with two 
of his security goons flanking him. “You 
know you can't take any of that home with 
you,” he says, all smug, and then he 
blabs on about disrupting the 

time flow and I point out 

that he told us a dozen times 
that nothing we do here can 
mess up the future, and he 
says: “That’s true, as long as 
you don’t take anything FROM 
the past TO the future. Then all 
bets are off” 

Which youd THINK some- 
one would have mentioned if it 
was such a big deal. But it’s just 

a bunch of bullshit that he made 
up so he could ‘confiscate’ *cough* 
STEAL *cough* my treasure. So 
let’s review. Temporal Ventures 
robbed me of my rightfully obtained 
property, manhandled me, searched 
my person — they even stole the 
doubloon I'd stuffed into my unmen- 

tionables because I KNEW theyd pull 
some crap like that — and then they had 
security glued to me for the rest of the trip 
so I couldn't even take a piss without one of 
them watching over my shoulder. 

Well, if THAT’S what you want from your 
‘vacation’ then go right ahead with Tempo- 
ral Ventures. They've lost my business. I was 
looking forward to checking out their Jurassic 
Jaunt’ package, but screw that. I bet they don't 
even let you ride the dinosaurs. = 


Scott C. Mikula is a husband, father, 
software developer, board gamer, and has 
been told he makes a mean chocolate-chip 
cookie. On his better days he is also a writer. 
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TECHNOLOGY FEATURE 


CHARTING THE 
BRAINS NETWORKS 


The field of connectomics is pulling neuroscience into a speedy, 
high-throughput lane that is generating vast amounts of data. 


\ 


BY VIVIEN MARX 


esearchers seeking to understand 
Re brain want big data. And they are 
getting them. Just as geneticists have 
moved from genes to genomes to the inter- 
acting network of factors that regulate and 
modify the genome, neuroscientists are going 
from studying single neurons to tracing how 
vast neuronal networks connect and interact. 
“T think this is a really exciting field,” says 


Massive stores of brain-tissue slides are providing a resource for scientists working on mapping neural networks. 


neuroscientist Moritz Helmstaedter at the Max 
Planck Institute for Neurobiology in Martin- 
sried, Germany, who is working to obtain a 
cell-level overview of the neuronal connec- 
tions — the connectome — of the mamma- 
lian cortex. “Many people are pretty ambitious 
about breaking the next barrier in understand- 
ing how the brain works by using this new field 
of connectomics.” 

Sceptics argue that current methods lack the 
power to map the massively interconnected 


web of around 100 billion neurons in the 
human brain. Even if technology can rise 
to the challenge, they say, it is impossible to 
decipher so much data. 

Clay Reid, a neuroscientist at Harvard 
Medical School in Boston, Massachusetts, 
and recently appointed as a senior investiga- 
tor at the Allen Institute for Brain Science in 
Seattle, Washington, counters detractors by 
pointing to recent progress in neuroscience. 
A few years ago, it was nearly impossible > 


11 OCTOBER 2012 | VOL 490 | NATURE | 293 


© 2012 Macmillan Publishers Limited. All rights reserved 


MPI/REF. 5 


ALLEN INST. BRAIN SCI. 


> to collect data on networks of neurons. “It’s 
not routine now but it’s easier,’ he says. And, he 
adds, even without a map of the entire brain, 
charting just a fraction of a neural circuit is an 
important advance. 

The way the neuroscience community is 
embracing connectomics and its big-data 
harvest is part of a technical and cultural shift. 
“We're moving away from artisan cottage- 
industry science and going after bigger, harder, 
more complex problems, says geneticist Geof- 
frey Duyk, who is partner and managing direc- 
tor at the venture investment company TPG 
Biotech in San Francisco, California. 

Advances in high-throughput technology 
and automation are giving neuroscientists 
both speed and scale. By combining whole- 
brain magnetic resonance imaging and 
computational methods, some teams are map- 
ping the major routes of traffic in the human 
brain. Other researchers are concentrating on 
animal models to chart the brain’s neuronal 
circuitry on a much smaller scale, at the level 
of individual neurons and their projections of 
axons, dendrites and synaptic connections. 
The goal is to add the information gleaned to 
the animal and human connectomes, build- 
ing them up as resources for understanding 
behaviour in health and disease. 


SCALING UP 
Studying the neuronal web at every possible 
scale demands the integration of every avail- 
able method. Old-fashioned staining tech- 
niques are used alongside new methods of 
tissue preparation and methods taken from 
genomics (the study 
of genomes), and 
traditional and new 
approaches are used 
in microscopy and 
image analysis. Auto- 
mation pipelines and 
computational meth- 
ods are essential to 
handling the data 
— but so is skilled, 
manual artistry. 


The brain is the “All of us are 
only organ for which convinced that 
thenumberandtypes We can get tens 
ofcellsitcontainshas Of thousands or 
not been determined. maybe hundreds 
Just being able to dif- of thousands of 
ferentiate these cells connections.” 
from one another ClayReid 


under a microscope 

is an important advance. Jeff Lichtman and his 
colleagues at Harvard University’s Center for 
Brain Science in Cambridge, Massachusetts, 
apply light and electron microscopy to study 
how neural circuits change over the course 
of development. Using a genomic technique 
called Brainbow, the group was able to make 
specific DNA modifications in transgenic 
mice to label neurons fluorescently in more 


HIGH-THROUGHPUT ANATOMY (igabaihleReley’ 
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Retinal neurons are traced to build a connectome. 


than 90 colours’. The researchers could then 
distinguish individual neurons in the brain’s 
dense tangles of otherwise identical neurons. 
Separately, the Brainstorm Consortium, which 
is composed of scientists from Harvard and the 
Massachusetts Institute of Technology (MIT), 
in Cambridge, Massachusetts, and from Stan- 
ford University, California, is also working on 
new methods for application in connectomics, 
in areas such as brain-tissue preparation, imag- 
ing and image analysis. 

Because sample preparation in neuroscience 
is labour and time intensive, several groups 
are working on improving it. Lichtman’s team 
has developed the automatic tape-collection 
mechanism for ultramicrotomes (ATM), 
which automatically sections brain tissue and 
collects thousands of slices on a moving tape. 
The slices can then be stained and imaged 
on a scanning electron microscope. “It vastly 
improves sectioning methodology,” says neu- 
roscientist Scott Emmons, who has just set 
up an ATM in his lab at the Albert Einstein 
College of Medicine in New York. Classically, 
Emmons explains, sections are floated on 
water and then gathered manually. “They got 
lost, they get scrambled up, you cannot make 
long series,” he says. 

At the moment, imaging and image analy- 
sis can be only partially automated. But, 
for connectomics researchers to obtain the 
massive amounts of imaging data they want, 
technology will need to be expanded. Licht- 
man and Winfried Denk of the Max Planck 
Institute for Medical Research in Heidelberg, 
Germany, note that this approach will neces- 
sitate “the automation, even industrialization 
of imaging”. 

High-throughput neuroscience reaches 
industrial scale at the Allen Institute for Brain 
Science. When a project is explored for fea- 
sibility, the institute looks for a ‘brute force’ 
method to convert it to managed, pipelined 


programmes with milestones and schedules, 
says Amy Bernard, director of structured 
science at the institute. This approach has 
been adapted from industry, says Chinh Dang, 
the institute’s chief technology officer (see 
‘Neuroscience goes industrial’). 

Large-scale efforts at a number of other 
labs take on circuits in big ways. At Harvard 
Medical School, Reid and his colleagues have 
been mapping and studying neural connec- 
tions in the mouse visual cortex. To do this, 
they first looked at a brain region in vivo, using 
two-photon microscopy — a technique that is 
good for imaging live tissue — and indicator 
molecules that show when neurons fire and 
release calcium. They then captured images 
from fixed tissue with a custom-built serial 
section transmission electron microscope 
camera array’. 

The team captured more than 3 million 
digital images of the mouse cortex and, in 
collaboration with the Pittsburgh Supercom- 
puting Center in Pennsylvania, stitched them 
together into 1,200 montages comprising 
10 billion pixels apiece, and aligned them in 
three dimensions. The computing support was 
key. “We would not have been able to do the 
work that we did last year without the help of 
that team,” says Reid. 

Denk, Helmstaedter and Kevin Briggman 
from the National Institute of Neurological 
Disorders and Stroke in Bethesda, Maryland, 
used a labour-intensive approach in their 
recent study of a mouse retinal circuit. They, 
too, used two-photon imaging with calcium 
indicators to capture brain functional patterns 
in vivo and then imaged fixed tissue to obtain 
ultrastructure information’. 

But for that last step they used serial block- 
face scanning electron microscopy, an auto- 
mated technique in which a block of tissue 
is imaged and then the top slice is shaved off 
to image the next slice. With their own visu- 
alization-annotation software tools, such as 
KNOSSOS (www.knossostool.org), they anno- 
tated the images to show how neurons branch 
and interconnect. 


REACHING OUT TO CROWDS 

Both teams performed precise segmenta- 
tion on the images they obtained: tracing 
the branching structure of neurons and the 
synapses between them. This approach has 
revealed links between structure and function 
— illuminating, for example, how the many 
types of neurons in the retinal circuit com- 
pute visual signals. “The way we got there was 
entirely segmentation by hand,’ says Reid, who 
says that the human eye remains best suited for 
the task, given our innate skill at recognizing 
patterns. 

Ultimately, however, researchers such as 
Sebastian Seung at MIT and Dmitri Chklovskii 
at the Howard Hughes Institute's Janelia Farm 
research campus in Ashburn, Virginia, hope to 
teach machines how to perform segmentation. 
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Ascientist subdivides a brain segment by hand, ready for it to be cut and mounted on slides. 


“Image processing is currently the bottleneck,” 
says Viren Jain, a neuroscientist at Janelia 
Farm. 

One component of that bottleneck is the fact 
that circuits can be metres long. Contouring 
a path just one-third of a metre long would 
take a human annotator some 60,000 hours, 
or around 30 years assuming a normal work 
week. However, if manual reconstruction has 
its challenges, so too does automation, note 
Denk, Briggman and Helmstaedter®®. Speed- 
ing up image analysis is hindered by stains that 
can emphasize the cell surface but obscure the 
visibility of synapses. And tracing a circuit is 
dogged by errors even with experts doing the 
work, which makes it hard to imagine howa 
computer program could do as well or better. 

Ultimately, new tools are needed to deter- 
mine the ‘volume’ of neurons — that is, when 
different pathways touch each other and may 
connect. In the meantime, Reid’s Harvard team 
continues to collect electron microscopy data 
and analyse them using open-source tools such 
as TrakEM2 (go.nature.com/kgsalt) and the 
Collaborative Annotation Toolkit for Massive 
Amounts of Image Data (CATMAID) (www. 
catmaid.org). “Once the computers take over,” 
he says, wiring diagrams will capture more 
connections in a circuit. “All of us are con- 
vinced that we can get tens of thousands or 


maybe hundreds of thousands of connections, 
and then it will really become circuit science.” 

Until then, one way of speeding up seg- 
mentation is to use crowds to sift through the 
data and then reconcile the many results. The 
software for tracing 
and annotating elec- 
tron micrographs 
initially ran mainly 
on expensive lab 
computers, which 
limited access, says 
Helmstaedter. But in 
2006, he and his col- 
leagues converted 
KNOSSOS into dis- 
tributable software 


and began recruiting “People are — 
students, of whom Pretty ambitious 
they now have about about breaking 
200, toannotateelec- thenext barrier 
tron micrographs at inunderstanding 
homeorinthedorm. how the brain 
“This particularcom- works.” 


bination of being 
able to browse these 


Moritz Helmstaedter 


extremely large data sets and still do it ona 
laptop is really so far unique,’ he says. 

The team uses different software, the 
redundant-skeleton consensus procedure 
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(RESCOP), to reconcile the many annotations 
generated by the students, which overlap in 
some ways and conflict in others. 

Helmstaedter and his colleagues are now 
looking to expand the crowd working on their 
data to include members of the public. To 
entice participation, the team has hired gam- 
ing programmers to add an element of play. 
“The idea is basically to fly through the brain, 
it has the feel of a flight simulator,’ he says of 
the prototype. Similarly, scientists at MIT have 
launched Wired Differently (www.wireddiffer- 
ently.org) to engage crowds in neuron tracing. 

In the long term, Helmstaedter hopes that 
automation will become so good that people 
are only needed for annotating complex brain 
regions. Having just launched his own lab, 
he is gearing up to pursue his goal of a cell- 
level connectome of the mouse cortex, which 
will involve tracing the connections between 
billions of neurons. “That is really an issue 
of scale,’ he says. “We need a factor of 100 in 
terms of annotation efficiency.” 


SMALL LABS THINK BIG 

High-throughput neuroanatomy might seem 
to be for large labs only, but the methods they 
use will enable smaller ones to follow in their 
path, Helmstaedter says. A few smaller labs are 
already scaling up, using new methods to pull 
more data out of optical and electron micros- 
copy images than was possible a few years ago. 

In 2004, fresh from his postdoc at the Uni- 
versity of Southern California in Los Angeles, 
where he studied connectivity in the rat brain, 
neuroscientist Hong- Wei Dong was among 
the first scientists to be recruited to the Allen 
Institute to create the Allen Brain Atlas, a map 
of gene expression for the entire mouse brain. 

“T never thought science can be done on this 
kind of scale,” says Dong, who has now left the 
Allen Institute to start his own lab at the Uni- 
versity of California, Los Angeles (UCLA). “If 
I didn't have that kind of experience, I prob- 
ably would have never thought of mapping the 
connectivity for the entire brain” At UCLA he 
launched iConnectome, a large-scale optical- 
imaging project, the aim of which is to create a 
three-dimensional connectome of the mouse 
brain. He uses optical microscopy and fluores- 
cent markers to capture pathway information; 
the images are on a coarser scale than with 
electron microscopy but they show how brain 
regions interact. 

Dong’s methods include classic neuroana- 
tomical techniques of surgery, preparing and 
mounting tissue on slides, using tracers to 
show neuronal inputs and outputs, and clas- 
sic staining techniques such as Nissl staining, 
which shows the architecture of brain regions. 
“That is all manual labour,” he says. 

In neuroanatomy, tracers are usually injected 
into a brain region one at a time. In Dong’s pro- 
ject, animals receive two injections in two sites, 
allowing scientists to examine input and output 
pathways concurrently, and yielding four times 
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more data than in studies with single tracers. It 
also cuts down on the cost, time and number 
of animals needed, he says. 

In the decade since his postdoc, Dong 
notes that scanning tissue slides has sped up 
dramatically, a trend he expects to continue. 
Microscopes can automatically scan slides 
one by one, each of which holds 50-100 tissue 
sections. Olympus, headquartered in Tokyo, 
Hamamatsu in Hamamatsu City, Japan, and 
Aperio, which was recently acquired by Leica 
Biosystems in Nussloch, Germany, all make 
high-throughput slide scanners. But speed is 
not everything, because analysis must be done 
with care. For example, Dong says that minor 
wrong-way tracer transport can lead to false 
conclusions. 

And not all image-processing steps gain 
speed through automation. For example, 
tissue slices are registered automatically to 
a corresponding image in the Allen Refer- 
ence Brain Atlas, but contrast and brightness 
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Tracers are used to map neural networks. 


require manual adjustment. 

Even the process of automatic registration 
can be difficult, because slices differ slightly 
from one another, causing distortion. To match 
each image to the reference, Dong’s team keeps 
enhancing the registration algorithm. “That is 
a difficult part, but a key part for the future 
if we want to evolve high-throughput analysis 


Neuroscience goes industrial 


The Allen Institute for Brain Science in 
Seattle, Washington, runs efforts to create 
and offer online public resources about 
the brain, including atlases of the human, 
mouse and developing brains, as well as a 
mouse brain connectivity atlas. One of its 
major goals has been to combine studies 
of neuroanatomy and gene 
expression into “an integrated 
means of understanding the 
brain”, says Amy Bernard, 
director of structured science 
at the institute. And that 
means amassing data. Now 
the institute is moving into 
broader areas, going from 
genes to circuits to behaviour, 
which means more data still. 

Every week, using 
automated slide assembly, 
up to 16,000 brain sections 
are mounted on slides. The 
slides then go through in situ 
hybridization — a method 
used for gene-expression 
analysis — which is mostly 
automated, with six robots 
handling more than 190 slides per run, and 
two automated slide coverslippers working 
five slides a minute. 

According to a recent tally, the institute 
has 1.7 million brain-tissue sections, on 
almost a million slides, each of which are 
stored both digitally and physically in filing 
cabinets stacked 10 drawers high throughout 
the microscopy suite and in off-site storage. 
Stacked one on top of another, the piles of 


slides would be 68 kilometres high. 

“We keep everything, we’re extremely 
conservative,” because researchers might 
need to return to the data or the samples, 
says Bernard. All slides are kept, as are all 
raw data, and all images are saved and 
backed up. As in smaller neuroscience 


labs, the institute begins with a brain that 

is prepared for the microscope using 

classic neuroanatomy techniques such as 
histological staining. There is “nothing new or 
fancy there”, she says, but doing it in “huge 
volumes” is new. 

The production pipeline includes cutting 
tissue; mounting the samples on slides; 
performing in situ hybridization, staining, 
or both; scanning the images; and saving 


The use of robots speeds up time-honoured neuroanatomy techniques. 


of the data,” he says. It is also not easy to keep 
good computing staff, who often give in to the 
lure of Silicon Valley. 

The three-dimensional visualization in 
iConnectome allows users to navigate through 
the brain’s pathways, and layers images of 
tracer-labelled neurons against the backdrop 
of the atlas reference images, giving them a 
geographical context. “We put this map behind 
each individual brain section, and as people 
change the opacity, the atlas can give them a 
reference,’ Dong says. The combination of 
high-throughput approaches and online pres- 
entation enables brain-structure information 
to be shared beyond neuroanatomy circles, he 
says, to researchers focusing on physiology or 
behaviour. 

While Dong finds ways to offer data-laden 
images to others, in his garage sit dozens of 
boxes of his own slides — around 6,000 of 
them, containing a total of 60,000 individual 
tissue slices from 200 rat brains. He hopes to 


the data into an informatics pipeline, all 
automated steps. 

Compiling their atlas of gene expression 
in the mouse brain generated 600 terabytes 
of data in four years. “This year alone we’re 
generating 1.7 petabytes of data,” says Chinh 
Dangh, the institute’s chief technology officer. 
Information flow through the 
pipeline of processing stages 
is automated, as at that scale 
“there is no way that anyone 
can manually manage that 
kind of data”, she says. 

The teams also work with 
instrument manufacturers 
to adapt technology. “We're 
really tinkerers,” says Bernard. 
For example, they engineered 
a faster slide-feed cassette for 
their microscopes. “You can 
put in 100 slides, and it will 
automatically load them onto 
the stage of the microscope 
and then scoot them off 
the stage when it is done 
scanning,” she says. 

Data infrastructure is built 
from the beginning. “We don’t wait until the 
science has been done and then people 
start to look at the data and then build tools 
around the data,” says Dangh. The institute 
uses open-source databases and tools, 
and this summer invited academics and 
independent programmers to a ‘hackathon’, 
in which participants had access to the 
institute’s application-programming interface 
to develop new data-analysis tools. 
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image them so that he can share his work on 
the stria terminalis brain region with others. 

Brendan Brinkman, senior product man- 
ager of the Olympus America Scientific 
Equipment Group, based in Center Valley, 
Pennsylvania, has worked with Dong and 
other neuroanatomists who use slide-scanning 
and point-scanning confocal technology. To 
hasten data capture in microscopy further, the 
company is expanding its multi-point mapping 
software, Brinkman says. In scanning confocal 
imaging, a raster scan moves across a sample. 
To focus only on differences in fluorescence, 
the scanner can jump from one area to another 
and capture bursts of data. “You can adjust the 
scanning path to make it as fast as possible,” he 
says. Olympus has also tailored the software 
to let scientists capture multiple-channel fluo- 
rescent signals. “Certainly, neurobiology is the 
key group for this kind of technology,’ he says. 

And connectome projects are increasing 
sales of slide scanners, as researchers seek the 
quick generation of data from large sample 
sets. TPG Biotech’s Duyk agrees that high- 
throughput approaches to neuroanatomy 
could be a commercial boon. “It certainly 
creates opportunities for the life-science tools 
companies to push the cutting edge of their 
technology,’ he says. 

For those lacking these tools, services are 
emerging. Earlier this year, for example, 
Renovo Neural, a spin-out from the Cleveland 
Clinic in Ohio, launched an electron micros- 
copy service. Customers deliver samples, 
which the company sections with an ultrami- 
crotome from Gatan in Pleasanton, California, 
then images with an automated serial block- 
face scanning microscope from Zeiss in Jena, 
Germany, to return hundreds of ultrastructure 
images. 


NETWORK SUCCESS 

The first complete connectome was obtained 
for the worm Caenorhabditis elegans in 1986. 
Sydney Brenner and his colleagues at the MRC 
Laboratory of Molecular Biology in Cam- 
bridge, UK, completed their wiring diagram 
of the hermaphrodite worm nervous system by 
tracing images of neurons on electron micro- 
graphs by hand’. The hermaphrodite is one of 
the animal's two genders; but the male worm 
has proven to be tougher to pin down, neu- 
ronally speaking. 

Teams tried to map the male’s nervous 
system as Brenner’s group had for the her- 
maphrodite, but assembling its branching 
structure by hand was too hard. “The field 
of connectomics kind of got stuck,” Emmons 
says. Electron microscopist David Hall at the 
Albert Einstein College of Medicine is the 
custodian of material from the Brenner lab: 
notebooks, embedded tissue blocks, thin sec- 
tions, negatives and the manually annotated 
C. elegans electron micrographs. This year, he, 
Emmons and colleagues revisited 5,000 of the 
historic images, analysing them in a new way’. 


They translated their analysis of the micro- 
graphs into a map of all the connections and 
their strengths in the male C. elegans posterior 
nervous system. Of the 170 neurons they stud- 
ied, 144 were involved in the circuit controlling 
mating behaviour, allowing the team to make a 
link between connectivity at the cellular level 
and behaviour. 

Previous researchers had counted synapses 
but had not included synapse size, Emmons 
says, which the team now did. Then they 
applied a mathematical model to use size as a 
proxy for the functional strength of each neural 
connection. The result is a map of the neural 
network’s connec- 
tivity that includes 
quantitative cell- 
biology information. 
Across the connec- 
tome, they found that 
interaction strengths 
between neurons 
varied more than 
100-fold. 

Using the existing 
maps from Bren- 


© 

With one shot ner’s lab made neu- 
we can cover ron tracing easier. 
awhole worm 


“Somebody has 
already gone before 
you and put coloured 
numbers on them,” 
Emmons says. Elegance (go.nature.com/ 
nvsfnn), the group's software tool, accelerated 
neuron tracing by translating mouse clicks into 
map coordinates as the team scrutinized the 
digitized micrographs. 

Emmons thinks that his methods can speed 
up connectomics efforts in other small organ- 
isms. His group will expand its range in sev- 
eral ways; for example, by using their new 
ATM machine to automatically collect a series 
of slices for later imaging. Ultimately, says 
Emmons, using the ATM, “we're hoping for 
a 20,000 unbroken series, which would cover 
an entire worm, which has never been done 
before”. Technology like this gives a small lab 
the tools to tackle large projects such as finish- 
ing the male worm’ connectome, comparing it 
to the hermaphrodite or mapping the develop- 
ing nervous system. The C. elegans community 
is “back in the connectomics business’, he says. 

Emmons will move on to analyse synaptic 
connections in the mouse brain, although, for 
now, his focus is on the worm. The ATM will 
deliver brain sections aplenty for imaging, but 
he is confident that his new scanning electron 
microscope with its bigger field of view is up to 
the task. “With one shot we can cover a whole 
worm cross-section,’ he says. 

Applying these techniques to mamma- 
lian brains takes more than automation. The 
human brain has over 80 billion neurons and 
the mouse brain has around 70 million. And 
both have more densely woven webs of neu- 
rons than in the worm. “So you can’t just scale 


cross-section.” 
Scott Emmons 
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up from a little lab and make a big lab and do 
it” Emmons says. 

Big labs mapping large circuits on the 
single-neuron level are trying new scale-up 
approaches. Deciphering mammalian neu- 
ral circuits is Reid’s goal in his new position 
as senior investigator at the Allen Institute, to 
which he was appointed as part of the institute's 
ten-year US$300-million move to map con- 
nectomes and to use them to reach a broader 
understanding of brain function that integrates 
genes, circuits and behaviour. 

This project, called MindScope, is an 
attempt to go beyond the anatomy and wiring 
of the brain to how things are computed in the 
cortex, by having scientists work side by side 
to study cell types, neural coding, modelling 
analysis and theory. “It's a dream come true,” 
says Reid. 

He will use transgenic mice to identify the 
different cell types in the cortex and thalamus, 
and will then focus on deciphering the neural 
coding in the visual parts of the brain using a 
combination of techniques: behavioural analy- 
sis, physiology, imaging with calcium indica- 
tors and electron microscopy. The results will 
be compiled into what he calls network anat- 
omy, which is a wiring diagram with informa- 
tion piled onto it to map and understand the 
connectome’ dizzying array of functionalities. 

As the wealth of data from using different 
imaging modalities and from integrated large- 
scale projects comes in and is collected and 
annotated, labs large and small will still need to 
put their heads and computing power together 
for data analysis. “Astronomical amounts of 
connectomics data are being generated at an 
exponential rate; extracting meaning from it 
is the bottleneck that hasn't been broken,” says 
Larry Swanson, neuroscientist at the University 
of Southern California and president elect of the 
Society for Neuroscience in Washington DC. = 


Vivien Marx is technology editor at Nature 
and Nature Methods. 
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CORRECTION 

The article ‘Building better biobanks’ 
(Nature 486, 141-146; 2012) wrongly said 
that Freezerworks sells automated freezers. 
In fact, it makes data-management software 
for tracking samples held in such freezers. 
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Promiscuous vesicles 


The unexpected finding that neurons can co-release two neurotransmitter molecules, dopamine and GABA, through a 
common mechanism provides a further advance in our understanding of the nervous system. SEE LETTER P.262 


JOHN T. WILLIAMS 


he striatum, a part of the brain that 

regulates motivation, reward responses, 

feeding and movement, integrates 
signals from other brain regions such as the 
substantia nigra and the ventral tegmental 
area. The inputs from these two regions arrive 
through nerve fibres that release dopamine, 
a neurotransmitter molecule that modulates 
neuronal activity by means of a slow-acting 
process. However, on page 262 of this issue, 
Tritsch et al.’ report that these modulatory 
neurons can also induce a rapid, short-lasting 
inhibition of striatal neurons by releasing 
another neurotransmitter, y-aminobutyric 
acid (GABA). Surprisingly, the authors found 
that this GABA-mediated effect was depend- 
ent on the protein VMAT2, which is required 
for dopamine secretion, rather than on VGAT, 
a protein that was thought to be needed for 
GABA release. 

Proteins such as VMAT2 and VGAT are 
called vesicular transporters, because they 
pump neurotransmitters from the neuronal 
cytoplasm into vesicles that are then unloaded 
at the synaptic junction between a sender neu- 
ron and a receiver neuron (Fig. 1). The released 
neurotransmitters bind to specific receptor 
proteins on the surface of the receiver cell, 
leading to changes in the cell’s activity — by, 
for example, altering the flow of ions that cross 
the cell membrane through proteins called ion 
channels. Neurotransmitters can also bind to 
receptors on the sender cell (autoreceptors) 
that typically modulate the release or synthesis 
of the same neurotransmitter. 

Dopamine-releasing (dopaminergic) neu- 
rons that project from the substantia nigra 
and the ventral tegmental area (VTA) form 
a heterogeneous population; these cells dif- 
fer in the ion channels that they express on 
their surface’, in their receptor proteins (such 
as G-protein-coupled receptors’) and in their 
vesicular transporters’. Some of the neurons 
can secrete glutamate”’ (a neurotransmitter 
and a precursor in the synthesis of GABA), 
and dopamine and glutamate are stored and 
released from the same synaptic vesicles in 
a subset of dopaminergic neurons located 
mainly in the VTA. Glutamate transport into 
vesicles has been shown to be dependent on a 


Dopamine 


GABA 


VMAT2 
(vesicular 
transporter) 


Sender neuron 


GABA 
receptor 


Receiver neuron 


Dopamine transporter 


Figure 1 | Two messages in one parcel. Information can flow from one brain area to another — for 
example, from the substantia nigra to the striatum — through neural fibres that end in synaptic connections 
between neurons. In the sender neuron, various proteins (vesicular transporters) pack neurotransmitter 
molecules such as dopamine into vesicles and, when the cell is activated, the synaptic vesicles are discharged 
into the space between the two neurons. The neurotransmitters then bind to receptor proteins on the surface 
of the second neuron, triggering changes in the cell’s activity. They can also be returned to the sender cell’s 
cytoplasm by specific transporter proteins. Tritsch et al.' report that dopamine-releasing neurons projecting 
from the substantia nigra can use the vesicular transporter VMAT2 to store dopamine together with another 
neurotransmitter, y-aminobutyric acid (GABA), in the same vesicles. As these compounds bind to receptors 
that have different effects on the receiver cell, their co-release allows dopamine-releasing neurons to 
modulate the activities of striatal neurons in various ways. 


specific vesicular transporter (VGLUT2)*. 

To clarify how dopaminergic neurons in 
the substantia nigra affect striatal activity, 
Tritsch et al. used an optogenetic technique 
that allowed them to turn the neurons on 
and off in brain slices obtained from geneti- 
cally engineered mice. In this experimental 
set-up, brief pulses of light induced the neu- 
rons to secrete dopamine. Surprisingly, the 
authors found that this selective activation 
of dopaminergic neurons produced large, 
GABA-dependent inhibitory currents and 
small, glutamate-dependent excitatory cur- 
rents in certain striatal neurons. The GABA- 
dependent currents were large enough to 
inhibit the firing of the striatal neurons. Even 
more surprising was the finding that knocking 
out VGAT (the vesicular GABA transporter) 
did not block GABA’ inhibitory effect, raising 
the question of where the released GABA was 
coming from. The researchers went on to show 
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that GABAs inhibitory currents were blocked 
by inhibiting VMAT2, suggesting that GABA 
is concentrated with dopamine in the same 
synaptic vesicles. 

Further support for this conclusion came 
from a remarkable series of experiments. The 
authors found, for example, that decreas- 
ing the release of dopamine by inhibit- 
ing its synthesis, or by knocking out the 
expression of VGLUT2 or VGAT, did not 
affect the inhibition by GABA. By contrast, 
VMAT inhibitors blocked the secretion of 
both dopamine and GABA. Taken together, 
these assays indicate that VMAT2 alone is 
responsible for the accumulation of GABA in 
dopamine-containing vesicles. How GABA 
is synthesized in these neurons remains to be 
determined, because only a small subset of 
dopaminergic neurons seems to express 
the gene Gad65 (also known as Gad2)*, 
which encodes a key enzyme involved in the 


synthesis of the neurotransmitter. 

Tritsch and colleagues’ work shows, there- 
fore, that co-released dopamine and GABA 
modulate the activity of the striatal output 
neurons in temporally and mechanistically dis- 
tinct ways. It also suggests that, in the striatum, 
GABA-mediated inhibition dominates over 
glutamate-mediated activation. Moreover, the 
authors’ results indicate that GABA is probably 
co-released by other dopamine neurons, and 
so it will be important to determine the physi- 
ological impact of this process in the striatum 
and in other brain areas in vivo. 

Although the optogenetic technique was 
instrumental in identifying this process, the 
method is known to produce synchronous 
stimulation of many neurons, and so it is not 
clear whether GABA-dependent inhibition 
will be observed in the absence of this mas- 
sive stimulation. It is worth noting, neverthe- 
less, that dopaminergic neurons in subregions 
of the substantia nigra and the VTA tend to 
burst in relative synchrony asa result of a com- 
mon input’. In addition, the fibres of neurons 
in these subregions tend to project to the same 
brain areas’, and so it is possible that a phasic 
(intermittent) GABA-dependent inhibition 
occurs in selective areas. 

As the authors point out, VMAT is expressed 
not only by dopaminergic neurons but also 
by neurons that release other monoamines 
(neurotransmitters that are synthesized from 
aromatic amino acids), such as serotonin, 
adrenaline and noradrenaline. These neurons 
would therefore also be expected to release 
GABA. If true, then this property can be added 
to the many similarities among monoaminergic 
neurons — for example, in their firing rate and 
pattern, in the co-release of glutamate and 
monoamines, and in the expression of their 
respective inhibitory autoreceptors. Trans- 
mission from monoaminergic neurons can 
thus induce not only slow, long-lasting modu- 
lation but also brief, phasic inhibition. In my 
opinion, this finding will change the way 
researchers think about the roles of mono- 
aminergic neurons in the functioning of 
the brain. m 


John T. Williams is at the Vollum Institute, 
Oregon Health and Sciences University, 
Portland, Oregon 97239, USA. 

e-mail: williamj@ohsu.edu 
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Single molecules put a 


ring on it 


A variant of a classical reaction has been used to generate short-lived chemical 
species called arynes, allowing the one-step synthesis of structurally complex 
benzene derivatives from simple precursors. SEE ARTICLE P.208 


JOHN T. S. YEOMAN & SARAH E. REISMAN 


he reactions of arenes — benzene and its 

derivatives — have long been exploited 

by organic chemists for practical appli- 
cations ranging from the production of simple 
commodity chemicals, such as polymers and 
dyes, to the synthesis of pharmaceuticals and 
other structurally complex compounds. One 
approach to the synthesis of arenes involves the 
generation of benzyne, a highly reactive, short- 
lived intermediate that can undergo a variety of 
chemical transformations. On page 208 of this 
issue, Hoye and co-workers’ describe a clever 
strategy for the generation of arynes (ben- 
zyne-containing intermediates) that involves 
the heat-induced isomerization of a starting 
material, without the need for additional 
reagents. This transformation is complemen- 
tary to existing methods for aryne generation, 
and could expand the synthetic utility of these 
well-studied intermediates. 

Arynes have been of theoretical and prac- 
tical interest to chemists for the past century. 
However, it was not until 1953 that an isotopic- 
labelling study conducted by the US chemist 
Jack Roberts and co-workers’ provided the 
first compelling experimental evidence for the 
intermediacy of benzyne ina reaction. Organic 
chemists soon recognized the synthetic utility 
of arynes, and considerable effort has been 
devoted to the development of methods to 
generate these highly reactive species’. 


Aminobenzotriazole 


Classically, arynes are generated from 
haloarenes — arenes in which one or more 
hydrogen atoms have been replaced by halogen 
atoms — using either strong bases or reactive 
organometallic reagents (Fig. 1). Although these 
methods work well for simple haloarenes, the 
use ofa strong base (or a reactive organometallic 
reagent) can lead to unwanted side reactions 
when arenes containing certain chemical 
groups are employed. Alternatively, arynes can 
be generated under ‘milder’ conditions that are 
compatible with a wider range of groups by 
oxidizing compounds known as aminobenzo- 
triazoles, or by heating arenediazonium car- 
boxylate salts (Fig. 1). However, the former 
substrates can be difficult to synthesize, whereas 
the latter can explode when handled. 

A breakthrough for the practical application 
of arynes was the discovery’ that exposure of 
compounds known as aryl silyl triflates to fluo- 
ride salts produces arynes at room tempera- 
ture under nearly neutral conditions (Fig. 1). 
This mild reaction enables the use of complex 
aryne precursors and reaction partners, and 
has inspired a renaissance in aryne chemistry’. 

The strategies highlighted above can all be 
considered elimination reactions, in whicha 
precursor arene loses one or more groups to 
form an aryne. Hoye and colleagues’ approach, 
however, depends on a classic reaction known 
as the Diels-Alder cycloaddition’, and there- 
fore represents a fundamentally different strat- 
egy. The Diels-Alder cycloaddition is the most 
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Figure 1 | Methods for preparing arynes. Arynes are synthetically useful chemical intermediates 
that can be prepared from a variety of starting materials using the general reactions shown. The groups 
highlighted in red are eliminated to provide arynes when subjected to the reagents or conditions 
indicated. In each case, the starting material contains a benzene ring. R represents any chemical group; 
X, X, and X, can be any halogen atom; Me is a methyl group. The chemical bond extending from the 
centre of each benzene (or aryne) ring indicates that the bond could be attached to any of the carbon 
atoms on the ring that is not depicted as having a chemical group attached. 
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Figure 2 | A fresh twist on the Diels-Alder reaction. a, In the classic Diels-Alder cycloaddition reaction, a diene (red) reacts with a dienophile (blue) to 
form a cyclic product. b, While attempting to prepare a ketotetrayne, Hoye et al.' instead isolated a tricyclic arene. They propose that a diyne group (red) in the 
ketotetrayne undergoes a hexadehydro-Diels—Alder reaction with a diynophile group (blue) in the same molecule. The resulting compound can be thought of 
as a hybrid of two ‘resonance’ forms (double-headed arrow indicates resonance), one of which contains an aryne (purple). A pendant silyl ether group (orange) 
then reacts with the aryne to form the tricyclic arene, in which the aryne has become a benzene ring (green). Curly arrows indicate electron movement, and the 
dots on the oxygen atom in the silyl ether represent a pair of electrons. Me, methyl; tBu is a tertiary butyl group, CMe,. 


widely studied and used method for synthesiz- 
ing compounds that contain rings of six atoms’ 
(Fig. 2a). It involves the simultaneous formation 
of two carbon-carbon bonds between a diene 
(a molecule that contains four carbon atoms 
and two carbon-carbon double bonds) and a 
dienophile (which has two atoms and one dou- 
ble bond) to make a six-membered ring. The 
reaction works for a wide range of dienes and 
dienophiles, proceeds with predictable isomeric 
selectivity, and is highly atom-economical: all of 
the atoms present in the reactants appear in the 
product, minimizing waste. 

As is the case for many scientific advances, 
Hoye and colleagues’ discovery of their 
‘hexadehydro-Diels—Alder’ (HDDA) reac- 
tion occurred serendipitously, during their 
efforts to prepare a compound known as a 
ketotetrayne. Instead of the ketotetrayne, they 
obtained a tricyclic arene as the major product 
(Fig. 2b). They recognized that a cycloaddi- 
tion process was taking place: two carbon- 
carbon triple bonds (alkynes) were reacting 
with a third alkyne triple bond from the same 
molecule to generate an aryne fused to one 
other ring. In terminology analogous to the 
diene and dienophile of the classical Diels— 
Alder reaction, the first two alkynes comprise 
a diyne group, whereas the third alkyne reacts 
as a diynophile. The transiently formed aryne 
then reacted with a fortuitously positioned 
silyl ether group in the molecule to yield the 
tricyclic arene as the final product. 

Realizing the potential power of this new 
reaction to generate arynes, Hoye et al. 
investigated the scope and limitations of the 
transformation. The reaction tolerates sub- 
strates containing a variety of chemical groups, 
demonstrating the generality of the reagent- 
and by-product-free conditions for aryne 
formation. The authors found that electron- 
withdrawing groups on the diynophile acceler- 
ate the reaction, an effect that is characteristic of 
standard Diels—Alder cycloadditions. In addi- 
tion to silyl ethers, the aryne intermediates can 
react with a variety of other chemical groups, 
such as alcohols (which contain OH groups), 
olefins (which contain carbon-carbon double 
bonds), amides (which contain NH groups), 


bromide ions (Br) and aromatic rings. 

Hoye and colleagues’ reactions are generally 
high yielding, and they provide rapid access 
to complex arenes that could be challeng- 
ing to prepare by more conventional means. 
Although the HDDA reaction has been the 
subject of previous theoretical and experi- 
mental studies*”, the authors’ report is the 
first to demonstrate practical applications of 
the transiently generated aryne intermediates 
and to thoroughly explore the substrate scope 
of these transformations. 

The diyne and diynophile in the described 
HDDA reactions are contained within the 
same molecule, but one can imagine that a 
reaction between a diyne and diynophile from 
two different molecules would be an even 
more powerful method to construct struc- 
turally complex arenes. The prospect of such 
a reaction, and of myriad new aryne-reaction 
modes that will be enabled by the reagent- 
free conditions, will undoubtedly encourage 
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continued interest in and investigation of 
aryne chemistry for years to come. = 
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Cambrian nervous 


wrecks 


Fossilized remains of an arthropod from the Cambrian period provide an unusual 
example of preservation of the brain and nervous system, and shed new light on 
when and how these tissues evolved. SEE LETTER P.258 


GRAHAM E. BUDD 


can resemble the chaotic attic ofan eccen- 

tric relative, stacked with ancient bric-a- 
brac of dubious usefulness. But the record has 
recently been throwing up some surprises that 
are bringing new order to this jumble. Our 
concept of dinosaurs, for example, has evolved 
from what were essentially bolted-together 
lumps of bone into living creatures covered in 
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graceful feathers — and in colour too’. Other 
fossil finds have brought changes to the scale of 
our understanding of evolution. For example, 
the discovery of exceptionally well-preserved 
fossil muscle fibres throughout the record’ and 
fossilized embryos from at least the Cambrian 
period’ , some 500 million years ago, have pro- 
vided remarkable insight into the fine-scale 
evolution of these tissues and life stages. Now, 
on page 258 of this issue, Ma and colleagues* 
describe preserved nervous tissue from the 


Cambrian — a find that grants palaeontologists 
access to the exclusive zoological club of those 
who study the brain and nervous system. 

Certain tissue types, such as bone and shell, 
are much more likely to be preserved in the 
fossil record than others, owing to their miner- 
alized and thus relatively refractory nature. Soft 
tissues, such as muscle or even the cuticle of 
arthropods (joint-legged invertebrate animals), 
which is largely proteinaceous and therefore 
liable to decay, are less likely to withstand oxi- 
dation, scavenging, mechanical damage and 
all the other vicissitudes that conspire to make 
the work ofa palaeontologist so testing. Tissues 
also preserve differently under different condi- 
tions — ‘bog people’, for example, are often all 
leather and no bones because of the acidity of 
the peat they were preserved in. This variable 
survival is captured by the idea of preserva- 
tion potential, and understanding this concept 
is critical when trying to reconstruct ancient 
organisms, because it places important limita- 
tions on what a difficult-to-interpret structure 
in a fossil might be. It is highly unlikely, for 
example, that delicate blood vessels are going 
to leave a trace in a fossil found in sandstone 
deposited in a high-energy environment. 

The concept of preservation potential has 
recently been receiving empirical support 
from decay experiments — a useful, if rather 
smelly, approach to understanding the fos- 
sil record’. These studies have taught us that 
descriptions of fossils that portray kidneys and 
gonads rightfully raise eyebrows. Yet, some- 
what surprisingly, they also suggest that the 
brain and nervous system might have a decent 
chance of preservation, at least in vertebrate 
animals’. The situation in invertebrates seems 
less clear, although possible examples have 
been reported®”. Maet al., however, have now 
provided convincing evidence for the preser- 
vation of the brain and partial nervous sys- 
tem in fossils of a Cambrian arthropod, and 
one of the most controversial and interesting 
arthropod species to boot, Fuxianhuia. 

The taxonomic position of Fuxianhuia has 
been the topic of some dispute, but it is generally 
considered to lie close to the most recent com- 
mon ancestor of the living arthropods’. As such, 
this fossilized material is ideally placed to con- 
tribute to neurophylogeny, the relatively new 
exploration of brain and nervous-system archi- 
tecture in a phylogenetic context’. What does 
the most informative of these Fuxianhuia speci- 
mens tell us? Starting at the top, the fossil dis- 
plays two large, faceted eyes on stalks (see Fig. 1 
of the paper). Within the stalks, a dark, iron-rich 
material traces what Ma and colleagues inter- 
pret to be three neuropils (concentrations of 
nervous tissue) from which optic nerves lead 
towards the brain. Here, the preserved material 
is divided into three regions that correspond 
to the classical arthropod brain ganglia of the 
protocerebrum, deutocerebrum and tritocer- 
ebrum (a ganglion is a mass of nerve cells). 
The authors suggest that all three regions lie, at 
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least in part, in front of the stomodeum — the 
part of the gut that leads directly to the mouth. 
From the deutocerebrum and tritocerebrum, 
stout nerves lead to the antenna and to what is 
presumably another appendage, respectively. 
This latter structure has proved problematic to 
describe in other Fuxianhuia fossils, and its very 
existence has sometimes been questioned’. 

This fossil provides the most convincing, and 
certainly the oldest, description of nervous- 
system tissue in a fossil arthropod. So what do 
these remarkable results imply? The most strik- 
ing feature, as the authors themselves stress, is 
how similar the tripartite brain of Fuxianhuia is 
to that of most modern arthropods, including 
crustaceans and insects. The exception to this 
are the branchiopods — the best-known exam- 
ple of which is Daphnia, the elegant water flea 
— which have a considerably simpler brain, in 
that they lack both complex optic neuropils and 
a prostomodeal tritocerebrum. If Fuxianhuia 
belongs in the stem group of all arthropods, 
then it follows that the organism's complex brain 
organization evolved relatively early within this 
phylum. If this is the case, the authors suggest, 
then the predecessors of extant arthropods with 
less complex brains (which includes not only 
the branchiopods, but also spiders, scorpions 
and their relatives) must have at some stage 
simplified their arrangements. 

However, there are two potential alternatives 
to this far-reaching conclusion. It is possible 
that the arrangement in Fuxianhuia is con- 
vergent to that in the modern crustaceans or 
insects; in other words, similar brain assemblies 
to that reported for Fuxianhuia evolved again 
in later arthropods. Or it may be that we need to 
rethink the systematic position of Fuxianhuia. 
That latter option would entail a substantial 
rearrangement of our present understanding 
of early arthropod evolution — not least in the 
highly vexed issue of the ‘great appendage prob- 
lem’®. This refers to the controversial identity 
of a large anterior appendage found in many 
Cambrian arthropods, and seemingly also in 
the Fuxianhuia specimen described here. Dis- 
covering which part of the brain this structure 
is innervated from will add vital information 
to this debate. Either way, Ma and colleagues’ 
findings will prompt hasty re-examination of 
many old specimens, and quite possibly some 
recasting of recent theories. m 
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ANIMAL MIGRATION 


Catching the wave 


Observations of the migration patterns of Norwegian red deer show that some 
animals ride waves of greener vegetation as spring spreads across the landscape, 
whereas others jump ahead in anticipation of this higher-quality food. 


JOHN M. FRYXELL & TAL AVGAR 


Tos migration of animals en masse 
provides one of the great mysteries of life. 
Birds do it, bugs do it, even fish in the 
deep blue sea do it. But what is it that compels 
them to pick up and move with monotonous 
regularity, giving up one seemingly good home 
in search of another? Writing in American Natu- 
ralist, Bischof et al.’ present a field study of red 
deer (Cervus elaphus) (Fig. 1) in Norway that 
offers a tantalizing hint as to the method behind 
this phenomenon. Their observations suggest 
that migration allows deer to take advantage of 
temporally changing feeding conditions in dif- 
ferent parts of the country that are dictated by 
variation in elevation and latitude. 

Such a behavioural strategy would only 
make sense in a world in which growing con- 
ditions are never identical from one place 
to another. As every gardener knows, this is 
indeed the case. For example, the onset of 
the growing season is slower at high latitudes 
than it is nearer the equator, and plants at high 
elevation experience spring much later than 
those at sea level”. In a country such as Norway, 


these geographic realities translate into waves 
of vegetation ‘green-up’ that commence at the 
end of winter and gradually spread northwards 
and upwards to higher elevations throughout 
the spring and summer”. 

Herbivorous mammals benefit most from 
feeding on younger, rapidly growing shoots 
rather than older plants, which are laden with 
indigestible compounds such as lignin and cel- 
lulose*. As a result, animals trying to restore 
weight after an arduous winter, or those shoul- 
dering the demands of lactation, could ben- 
efit by finding the crest of these green waves as 
they move across the landscape. Recent studies 
suggest that some migratory herbivores, such 
as elk in the Rocky Mountains’ or gazelles in 
Mongolia’, obtain better food than their non- 
migratory brethren. But what is not clear from 
previous studies is whether migrant animals 
simply ‘surf’ on the peak ofa single green wave, 
continually moving with it over the course of 
the growing season, or if they ‘jump’ between 
peaks (Fig. 2). Because migrations often occur 
over vast spaces, it is difficult to measure the 
success of one movement strategy against 
other options. Bischof and colleagues solved 


this problem using an elegant form of statistical 
book-keeping that allowed them to better 
understand how migrating animals value 
different spatial locations and to assess how 
much of an improvement in food intake is 
achieved by different migration styles. 

To tackle this problem, the researchers 
took advantage of a satellite-derived vegeta- 
tion index termed NDVI, which is a remote- 
sensed indicator of plant abundance’. Using 
bi-weekly snapshots of NDVI collected over 
a period of nine years, they estimated the date 
of fastest plant growth at each location. Then, 
for each of 294 study animals, they prepared a 
matrix that linked plant-growth data with the 
temporal sequence of the location of the deer. 
Summed values along the diagonal of each 
matrix provide a measurement of the quality 
of food experienced by a given animal over 
the course of the growing season. The sum of 
other row and column combinations reflect 
the food value that would have been obtained 
with alternative departure dates the deer might 
have taken. Ifa particular deer is truly surfing 
the green wave, then the sum of food qualities 
along its matrix diagonal should exceed that of 
any other departure schedule. 

Bischof and colleagues differentiated 
migratory from resident individuals. Their 
results demonstrate that migratory individu- 
als perform better than resident individuals, 
who in turn perform better than hypotheti- 
cal migratory animals would if they had lin- 
gered in their winter home ranges rather than 
migrating. The authors’ comparisons of deer 
from different regions of Norway suggest that 
the proportion of migratory individuals scales 


Figure 1 | Greener pastures. Some red deer (Cervus Sie migrate eachiy year to ntake advantage of new plant growth that emerges with the arrival of spring. 
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Figure 2 | Surfers and jumpers. Norwegian red 
deer change their grazing sites in response to the 
waves of vegetation greening that start at lower, 
more southerly locations at the end of winter (a) 
and gradually spread northwards and to higher 
elevations throughout the spring and summer. 
b,c, As a green wave travels across the landscape, 
resident (R) animals lag behind, whereas there are 
two possible movement strategies for migrating 
animals: ‘surfers’ (S) travel with the wave, whereas 
‘jumpers’ (J) move ahead in anticipation of 

where the wave will be at a later date. Bischof and 
colleagues’ combined analysis’ of satellite data on 
vegetation abundance and animal tracking shows 
that most deer use the ‘jumper’ strategy. 


with the potential gains that are obtainable. 
Surprisingly, however, they found that most 
migrant animals do not surf the green wave 
but rather jump it, arriving in their summer 
range well before the arrival of optimal feed- 
ing conditions. Furthermore, it seems that 
some animals leap from peak to peak over the 
course of the season. This suggests that many 
deer do not follow the optimal surfing strategy, 
but nonetheless benefit to a lesser degree by 
periodically jumping into greener pastures. 
Why they do so remains an open question — 
perhaps their jumping strategy stems from the 
need to balance feeding with physiological 
costs, predation risk or other constraints that 
might affect their survival during migration’. 

As with all new approaches, there are 
questionable assumptions in this study. For 
instance, one would like to have more evidence 


that the date of fastest plant growth actually 
translates into maximum food quality. Also, 
there is some uncertainty surrounding the 
authors’ classifications, because distinguish- 
ing between migrants and residents is often 
troublesome in species that occupy large home 
ranges, particularly when movements occur as 
a series of jumps rather than a smooth progres- 
sion. Perhaps most important is the issue that, 
although the authors’ method allows rigorous 
assessment of the ecological consequences of 
different patterns of movement timing along 
a single trajectory, it cannot tell us the relative 
value of a particular spatial trajectory against 
the infinite number of other conceivable pat- 
terns. This challenge invokes the well-known 
combinatorial problem referred to as the 
travelling salesman dilemma’. Although such 
complex spatial problems currently cannot be 
solved exactly, new statistical modelling tech- 
niques based on likelihood assessments could 
help to provide robust estimates’. 

These complications aside, Bischof and 
colleagues’ study offers a fresh take on move- 
ment ecology, a burgeoning field that blends 
developments in movement modelling with 
technological advances made available by 


EARTH SCIENCE 


NEWS & VIEWS | RESEARCH | 


satellite-based information systems*. The logic 
used by the authors should, in principle, be 
applicable to a broad range of animal move- 
ment patterns, ranging from nomadism to ter- 
ritoriality. Perhaps the day is not far off when the 
complex pattern of steps taken by an individual 
over its entire life can be predicted on the basis 
of the continually shifting mosaic of resources 
and costs with which it is confronted. = 
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When an oceanic 
tectonic plate cracks 


Analyses of two recent earthquakes of great magnitude show how complex the 
breaking of the oceanic lithosphere can be, how it is linked to earlier great events 
and how it triggers seismicity worldwide. SEE LETTERS P.240, P.245 & p.250 


JEAN-YVES ROYER 


n 11 April 2012, two great earthquakes 

of magnitude (M,,) 8.6 and 8.2 struck 

the northeastern Indian Ocean, a few 
hundred kilometres off the Sunda Trench 
that lies just west of Indonesia (Fig. 1). The 
two earthquakes were close in time and space 
(2 hours and 185km apart), and occurred 
not far from the epicentres of the devastating 
megathrust earthquakes that hit Sumatra’s 
Aceh region in 2004 (M,, 9.2) and Nias Island 
in 2005 (M,, 8.7). Fortunately, the 2012 intra- 
oceanic events were not damaging to human 
activities, but they will be marked in the annals 
of seismology for several reasons: first, because 
of their unusually high magnitudes given that 
they occurred in the interior of an oceanic 
tectonic plate, rather than at a plate bound- 
ary; second, because the ‘strike-slip faulting 
mechanism of the events is unexpected for 
earthquakes of such magnitude; and finally, 
because of the complexity of the rupture of the 


intervening plate. These exceptional proper- 
ties put the M,,-8.6 mainshock into the top- 
ten list of the world’s largest earthquakes since 
1900, and promptly aroused the curiosity of 
seismologists'” and those working on plate 
tectonics, because the events took place in one 
of the largest and most complex deforming 
zones in the oceanic lithosphere (Earth's crust 
and uppermost mantle). Three papers in this 
issue’ investigate these unprecedented events. 

Yue et al.* (page 245) unravel the extraordi- 
nary complexity of the underlying mechanisms 
of the 2012 events, which display perpendicu- 
lar (orthogonal) and discontinuous (en éch- 
elon) fault ruptures. By analysing short-period 
and long-period seismic waves recorded by 
stations in Europe and Japan, the authors iden- 
tified a sequence of shocks along four fault 
planes, and estimated the rupture length and 
amount of slip along the faults. The deduced 
radiation pattern of the seismic waves outlines 
the interplay between multiple orthogonal and 
en échelon faults, whose spatial distribution is 
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consistent with the epicentres 
of the first-week aftershocks. 
The orientations of the faults 
reflect the seafloor’s tectonic 
fabric””, with ruptures occur- 
ring alternately along faults 
that extend bilaterally west- 
northwest and east-southeast 
(parallel to the abyssal hills) or 
along faults extending north- 
northeast and south-south- 
west (parallel to the fracture 
zone’s orientation), the latter 
being perpendicular to the 
former. The high-level seis- O° 
micity in the Wharton Basin 
(Fig. 1), as well as seafloor 
mapping, had already pro- 
vided direct evidence that the 
fossil fracture zones — seismi- 
cally inactive, long linear faults 
generated by seafloor spread- 
ing — in this region are con- 
tinually reactivated®’. But the 
question remains as to why 
so much stress can accumu- 
late and provoke such large- 
magnitude earthquakes, and 
furthermore in an area that 
could have been activated by 
the nearby Aceh megathrust 
earthquake. 

This is exactly what 
Delescluse et al.’ (page 240) 
attempt to answer, by inves- 
tigating the causal relation- 
ship between the high-level 
seismicity along the Sunda 
Trench (Fig. 1) — in par- 
ticular the Aceh and Nias 
megathrust events — and 
the off-trench intra-oceanic 
seismicity. From Deles- 
cluse and colleagues’ Fig- 
ure 2 (page 241), there is 
no doubt that the Aceh and 
Nias events have shaken the 
Indo-Australian lithosphere 
which is plunging (subduct- 
ing) beneath the Sunda plate, 
and that they triggered a 
long-lasting, large increase in seismic activ- 
ity off-trench. By looking at how stresses are 
released in the vicinity of large ruptures (Cou- 
lomb stress changes), the authors show that co- 
seismic slips of the Aceh and Nias megathrusts 
can promote left-lateral strike-slip earthquakes 
with a direction parallel to the orientation of 
the Wharton Basin fracture zones, similar to 
the mechanisms observed in the April 2012 
events. However, such an approach assumes 
an elastic rheology, for which stress changes 
decrease immediately after the earthquake, as 
a new interseismic stress-loading cycle starts 
along ruptured faults. Delescluse et al. explain 
the seven-year lag between the Aceh and 
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Figure 1 | Seismo-tectonic chart of the India—Australia plate. Earthquakes in this 
region have either strike-slip faulting mechanisms (green dots and stars) with faults 
oriented parallel to fracture zones, or thrust faulting mechanisms (red dots and stars) 
with faults generally perpendicular to the fracture zones 
(green stars) that occurred on 11 April 2012 have now been investigated**; aftershocks 
of those earthquakes are shown in white circles’. The magnitudes (specifically the 
‘moment’ magnitudes, M,,) of earthquakes equal to or greater than 6.9 are indicated. 
Seismic events related to the subduction zone (east of the Sunda Trench, which is 
depicted as a white, broken line) are not shown, except for the megathrust earthquakes 
(red stars) at Aceh in 2004 (M,, 9.2) and at Nias in 2005 (M,,8.7). Events with unknown 
fault mechanisms within the India—Australia plate boundary are also not shown. Black 
dashed lines define the limits of the active boundary between the Indian and Australian 
plates'’. Gravity anomalies (measured in mgal) reflect the ocean basin’s topography 
beneath sediments, and are indicated by colours on the map: troughs are associated with 
negative anomalies, and ridges with positive ones. There is a close relationship between 
the distribution of earthquakes, particularly those that occurred on 11 April 2012, and 
the extinct seafloor fabric of the eastern Indian Ocean, such as the long north-south 
oriented fracture zones (lineated troughs) in the Wharton Basin. 


6,13 


April 2012 events by a buffering effect of the 
viscoelastic properties of the asthenosphere, 
the region of the mantle that underlies the 
lithosphere. The authors’ model predicts that 
the maximum post-seismic stresses generated 
by the Aceh megathrust could have reached 
the area of the April 2012 earthquakes seven 
to ten years later. 

In turn, the two April 2012 events may have 
triggered seismicity worldwide. Pollitz et al.° 
(page 250) show that the number of remote 
earthquakes with magnitudes larger than 5.5 
and up to 7 increased worldwide nearly five- 
fold in the six days following the April 2012 
events. The authors attribute this effect to the 
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. Two large earthquakes 


powerful radiation pattern of 
surface seismic waves gener- 
ated by the strike-slip mecha- 
nisms, but cannot exclude the 
possibility that the 2012 earth- 
quakes struck during a par- 
ticularly quiet seismic period, 
and so triggered an unusually 
large number of earthquakes 
that were already on the brink 
of occurring. 

Interestingly, the papers by 
Delescluse et al. and by Pollitz 
et al. reveal contrasting short- 
term and long-term ‘domino 
effects’ of major seismic events: 
the April 2012 events may have 
triggered quasi-immediate 
aftershocks far away, whereas 
the 2004 and 2005 events pos- 
sibly triggered aftershocks 
nearby several years later. The 
two studies emphasize how 
the rupture mechanism and 
the tectonic setting of earth- 
quakes govern their ability to 
trigger further seismic activity 
through seismic-wave excita- 
tion and short-term or long- 
term stress relaxation 

The occurrence off-trench 
of such major events is per- 
haps better understood in 
the context of the high lith- 
ospheric stresses caused by 
differential motion between 
the Indian plate, which is 
slowed down in its northward 
drift by collision with the 
Eurasian plate, and the Aus- 
tralian plate, which is rapidly 
subducting underneath Indo- 
nesia along the Sunda Trench. 
As a result, the two plates 
move towards each other at a 
rate of 15 millimetres per year 
in an east-southeast direc- 
tion’®, and have been doing so 
for at least 10 million years". 
The April 2012 events have 
the largest magnitudes ever 
recorded in the diffuse (as opposed to nar- 
row) India—Australia plate boundary, where 
earthquakes of magnitude 7 and greater are 
not uncommon (Fig. 1). 

Do these major intraplate earthquakes indi- 
cate the birth of a new, discrete plate bound- 
ary? Probably not, because this unusually 
high-level seismicity occurs over large areas 
(diffuse boundaries) bounded by otherwise 
stable and ‘quiet’ oceanic plate interiors (rigid 
plates), with all these pieces constituting the 
composite, deforming Indo-Australian litho- 
sphere’. The April 2012 events illustrate how 
oceanic lithosphere can fail in complex ways”*” 
asa result of the fabric structure it has inherited 
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from seafloor spreading”, and they shed new 
light on its mechanical behaviour and strength, 
and on earthquake physics. = 
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The neuron family 
tree remodelled 


The discovery of different classes of neuronal progenitor cell, destined to give 
rise to neurons in specific layers of the cerebral cortex, could presage the revision 
of a 50-year-old model of brain development. 


OSCAR MARIN 


he neocortex, the mammalian brain’s 

most recent evolutionary acquisition, 

controls the aspects of behaviour that 
make us human, from the fine-scale finger 
movements of a virtuoso pianist to the complex 
syntactic processing required for language. Its 
highly regular structure contains a complex 
matrix of excitatory and inhibitory neurons 
organized into distinct layers and columns. 
Neurons in any one cortical layer share general 
patterns of connectivity, whereas neurons in 
the same column are typically interconnected 
across layers and function as the basic unit of 
cortical operations’. Over the past 100 years, 
analysis of the developing brain has provided 
fundamental insights into the functional organi- 
zation of the neocortex’. However, Franco et al.’, 
writing in Science, provide evidence that puts a 
radically new perspective on the link between 
brain development and neuronal function. 

A fascinating aspect of cortical development 
is that excitatory neurons (also known as 
pyramidal cells) are ‘born in sequential order, 
with those located in deep layers of the neocor- 
tex being generated first and subsequently 
generated neurons being positioned in pro- 
gressively more superficial layers. It is usually 
assumed that pyramidal cells in all layers of the 
neocortex originate from a single type of pro- 
genitor cell, and that a progenitor’s ability to 
generate distinct classes of neuron decreases 
with time. That is, progenitors during early 
embryonic development can give rise to any 
class of pyramidal cell (they are ‘multipotent’) 
but are progressively restricted to producing 


superficial neurons later on*®. New neurons, 
originating from transient amplifying cells that 
become detached from the ventricular zone 
(the inner layer of the cortex, where progeni- 
tors reside), migrate to their final position by 
following the radial fibres of progenitor cells 
(Fig. 1a). These fibres serve as a scaffold that 
links the ventricular zone and the cortical sur- 
face, just like spokes connecting the centre of 
a wheel to its outer edge. 

According to this classical view, the ‘birth 
date’ of a neuron largely determines its fate, 
and sibling neurons are vertically aligned 
through the radial axis of the neocortex. This 
model has been highly influential, because it 
relates the concept of ontogenetic columns 
(vertical arrangements of sibling neurons born 
from acommon progenitor) with the notion of 
functional columns — the long-sought funda- 
mental unit of cortical computation. Indeed, 
recent studies suggest that sibling neurons are 
more likely to respond to the same sensory 
stimuli than a random subset of neighbouring 
neurons”. 

Franco et al. report that the gene-regulatory 
protein Cux2 is expressed by a small subset 
of progenitor cells in the developing cortex 
of mice, and that this subset increases over 
time. These findings are unexpected, because 
Cux2 expression was previously thought to be 
restricted to pyramidal cells in the superficial 
layers of the cortex. The authors then hypothe- 
sized that the Cux2-expressing progenitor cells 
might be fated to generate superficial pyrami- 
dal cells. To test this possibility, they generated 
and studied a mouse strain in which a fluores- 
cent protein was produced exclusively in cells 
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50 Years Ago 


It is now fifty years since, with the 
creation of the former Medical 
Research Committee, Great Britain 
took the lead in setting up central 
organizations for research ... 

The Medical Research Council 
conceives its function as: to watch 
over the whole fields of medical 
and related biological research 

so as to foresee, to the best of its 
ability, the needs and opportunities 
to give support to any promising 
research in these fields irrespective 
of the agent concerned; to work in 
partnership with the universities 
and professions on one hand 

and the various Government 
departments on the other, so that 
new knowledge may be made 
available as the need arises. 

From Nature 13 October 1962 


100 Years Ago 


Coming out from Queenstown 

on September 10 on her way 
toward Boston, the ss. Arabic was 
accompanied for some hours 

by a large flock of gulls. For the 
most part these birds were visibly 
working, flapping their wings, but 
occasionally a few would cease 
flapping and merely sail along 

for considerable distances ... An 
explanation of the ability of the 
birds to sail, under the conditions 
described above, is, I believe, found 
in the upward course of the wind 
which has struck the weather side 
of the ship and must rise in order to 
pass over it ... As the trail of smoke 
marks the direction of the wind 
with respect to the moving ship, 
the bird must, in order to sail with 
the same velocity and direction as 
the ship, have a motion relative to 
the air equal and opposite to the 
motion of the smoke relative to the 
ship. Accordingly, the bird’s axis is 
kept parallel to, and opposite to, the 
course of the smoke as indicated by 
its train from the funnel. 

From Nature 10 October 1912 


PERSPECTIVE 


doi:10.1038/nature11556 


A call for transparent reporting to 
optimize the predictive value of 


preclinical research 


Story C. Landis‘, Susan G. Amara”, Khusru Asadullah*, Chris P. Austin*, Robi Blumenstein®, Eileen W. Bradley®, Ronald G. Crystal’, 
Robert B. Darnell®, Robert J. Ferrante’, Howard Fillit!°, Robert Finkelstein', Marc Fisher!', Howard E. Gendelman”, 

Robert M. Golub!’, John L. Goudreau", Robert A. Gross!®, Amelie K. Gubitz!, Sharon E. Hesterlee!®, David W. Howells!”, 

John Huguenard'®, Katrina Kelner’, Walter Koroshetz!, Dimitri Krainc”°, Stanley E. Lazic!, Michael S. Levine??, 

Malcolm R. Macleod??, John M. McCall”*, Richard T. Moxley III?°, Kalyani Narasimhan”°, Linda J. Noble’, Steve Perrin”®, 

John D. Porter!, Oswald Steward”’, Ellis Unger®®, Ursula Utz! & Shai D. Silberberg’ 


The US National Institute of Neurological Disorders and Stroke convened major stakeholders in June 2012 to discuss 
how to improve the methodological reporting of animal studies in grant applications and publications. The main 
workshop recommendation is that at a minimum studies should report on sample-size estimation, whether and how 
animals were randomized, whether investigators were blind to the treatment, and the handling of data. We recognize 
that achieving a meaningful improvement in the quality of reporting will require a concerted effort by investigators, 
reviewers, funding agencies and journal editors. Requiring better reporting of animal studies will raise awareness of the 
importance of rigorous study design to accelerate scientific progress. 


issemination of knowledge is the engine that drives scientific 

progress. Because advances hinge primarily on previous obser- 

vations, it is essential that studies are reported in sufficient detail 
to allow the scientific community, research funding agencies and disease 
advocacy organizations to evaluate the reliability of previous findings. 
Numerous publications have called attention to the lack of transparency 
in reporting, yet studies in the life sciences in general, and in animals in 
particular, still often lack adequate reporting on the design, conduct and 
analysis of the experiments. To develop a plan for addressing this critical 
issue, the US National Institute of Neurological Disorders and Stroke 
(NINDS) convened academic researchers and educators, reviewers, 
journal editors and representatives from funding agencies, disease advo- 
cacy communities and the pharmaceutical industry to discuss the causes 
of deficient reporting and how they can be addressed. The specific goal 
of the meeting was to develop recommendations for improving how the 
results of animal research are reported in manuscripts and grant appli- 
cations. There was broad agreement that: (1) poor reporting, often 
associated with poor experimental design, is a significant issue across 
the life sciences; (2) a core set of research parameters exist that should be 
addressed when reporting the results of animal experiments; and (3) a 
concerted effort by all stakeholders, including funding agencies and 
journals, will be necessary to disseminate and implement best reporting 


practices throughout the research community. Here we describe the 
impetus for the meeting and the specific recommendations that were 
generated. 


Widespread deficiencies in methods reporting 


In the life sciences, animals are used to elucidate normal biology, to 
improve understanding of disease pathogenesis, and to develop therapeutic 
interventions. Animal models are valuable, provided that experiments 
employing them are carefully designed, interpreted and reported. Several 
recent articles, commentaries and editorials highlight that inadequate 
experimental reporting can result in such studies being un-interpretable 
and difficult to reproduce’. For instance, replication of spinal cord injury 
studies through an NINDS-funded program determined that many studies 
could not be replicated because of incomplete or inaccurate description 
of experimental design, especially how randomization of animals to the 
various test groups, group formulation and delineation of animal attri- 
tion and exclusion were addressed’. A review of 100 articles published in 
Cancer Research in 2010 revealed that only 28% of papers reported that 
animals were randomly allocated to treatment groups, just 2% of papers 
reported that observers were blinded to treatment, and none stated the 
methods used to determine the number of animals per group, a deter- 
mination required to avoid false outcomes”. In addition, analysis of several 
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hundred studies conducted in animal models of stroke, Parkinson’s disease 
and multiple sclerosis also revealed deficiencies in reporting key methodo- 
logical parameters that can introduce bias®. Similarly, a review of 76 high- 
impact (cited more than 500 times) animal studies showed that the pub- 
lications lacked descriptions of crucial methodological information that 
would allow informed judgment about the findings’. These deficiencies in 
the reporting of animal study design, which are clearly widespread, raise 
the concern that the reviewers of these studies could not adequately identify 
potential limitations in the experimental design and/or data analysis, lim- 
iting the benefit of the findings. 

Some poorly reported studies may in fact be well-designed and well- 
conducted, but analysis suggests that inadequate reporting correlates 
with overstated findings'®’. Problems related to inadequate study 
design surfaced early in the stroke research community, as investigators 
tried to understand why multiple clinical trials based on positive results 
in animal studies ultimately failed. Part of the problem is, of course, that 
no animal model can fully reproduce all the features of human stroke. 
It also became clear, however, that many of the difficulties stemmed 
from a lack of methodological rigor in the preclinical studies that were 
not adequately reported’’. For instance, a systematic review and meta- 
analysis of studies testing the efficacy of the free-radical scavenger NXY- 
059 in models of ischaemic stroke revealed that publications that 
included information on randomization, concealment of group alloca- 
tion, or blinded assessment of outcomes reported significantly smaller 
effect sizes of NXY-059 in comparison to studies lacking this informa- 
tion’. In certain cases, a series of poorly designed studies, obscured by 
deficient reporting, may, in aggregate, serve erroneously as the scientific 
rationale for large, expensive and ultimately unsuccessful clinical trials. 
Such trials may unnecessarily expose patients to potentially harmful 
agents, prevent these patients from participating in other trials of possibly 
effective agents, and drain valuable resources and energy that might 
otherwise be more productively spent. 


A core set of reporting standards 


The large fraction of poorly reported animal studies and the empirical 
evidence of associated bias®!°"*'*°, defined broadly as the introduction 
of an unintentional difference between comparison groups, led various 
disease communities to adopt general?” and animal-model-specific®** 7° 
reporting guidelines. However, for guidelines to be effective and broadly 
accepted by all stakeholders, they should be universal and focus on widely 
accepted core issues that are important for study evaluation. Therefore, 
based on available data, we recommend that, at minimum, authors of 
grant applications and scientific publications should report on randomi- 
zation, blinding, sample-size estimation and the handling of all data (see 
below and Box 1). 


Randomization and blinding 


Choices made by investigators during the design, conduct and inter- 
pretation of experiments can introduce bias, resulting in false-positive 
results. Many have emphasized the importance of randomization and 
blinding as a means to reduce bias®*”'~**”’, yet inadequate reporting of 
these aspects of study design remains widespread in preclinical research. 
It is important to report whether the allocation, treatment and handling 
of animals were the same across study groups. The selection and source 
of control animals needs to be reported as well, including whether they 
are true littermates of the test groups. Best practices should also include 
reporting on the methods of animal randomization to the various 
experimental groups, as well as on random (or appropriately blocked) 
sample processing and collection of data. Attention to these details will 
avoid mistaking batch effects for treatment effects (for example, dividing 
samples from a large study into multiple lots, which are then processed 
separately). Investigators should also report on whether the individuals 
caring for the animals and conducting the experiments were blinded to 
the allocation sequence, blinded to group allocation and, whenever 
possible, whether the persons assessing, measuring or quantifying the 
experimental outcomes were blinded to the intervention. 
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BOX | 
A core set of reporting standards 
for rigorous study design 


Randomization 
eAnimals should be assigned randomly to the various experimental 
groups, and the method of randomization reported. 
eData should be collected and processed randomly or appropriately 
blocked. 
Blinding 
Allocation concealment: the investigator should be unaware of the 
group to which the next animal taken from a cage will be allocated. 
eBlinded conduct of the experiment: animal caretakers and 
investigators conducting the experiments should be blinded to the 
allocation sequence. 
Blinded assessment of outcome: investigators assessing, 
measuring or quantifying experimental outcomes should be 
blinded to the intervention. 
Sample-size estimation 
eAn appropriate sample size should be computed when the study is 
being designed and the statistical method of computation reported. 
Statistical methods that take into account multiple evaluations of 
the data should be used when an interim evaluation is carried out. 
Data handling 
Rules for stopping data collection should be defined in advance. 
eCriteria for inclusion and exclusion of data should be established 
prospectively. 
eHow outliers will be defined and handled should be decided when 
the experiment is being designed, and any data removed before 
analysis should be reported. 
eThe primary end point should be prospectively selected. If multiple 
end points are to be assessed, then appropriate statistical 
corrections should be applied. 
elnvestigators should report on data missing because of attrition or 
exclusion. 
ePseudo replicate issues need to be considered during study design 
and analysis. 
elnvestigators should report how often a particular experiment was 
performed and whether results were substantiated by repetition 
under a range of conditions. 


Sample-size estimation 

Minimizing the use of animals in research is not only a requirement of 
funding agencies around the world but also an ethical obligation. It is 
unethical, however, to perform underpowered experiments with insuf- 
ficient numbers of animals that have little prospect of detecting meaningful 
differences between groups. In addition, with smaller studies, the positive 
predictive value is lower, and false-positive results can ensue, leading to the 
needless use of animals in subsequent studies that build upon the incorrect 
results**. Studies with an inadequate sample size may also provide false- 
negative results, where potentially important findings go undetected. For 
these reasons it is crucial to report how many animals were used per group 
and what statistical methods were used to determine this number. 


Data handling 
Common practices related to data handling that can also lead to false 
positives include interim data analysis”, the ad hoc exclusion of data*’, 
retrospective primary end point selection*', pseudo replication” and 
small effect sizes*’. 


Interim data analysis 

It is not uncommon for investigators to collect some data and perform 
an interim data analysis. If the results are statistically significant in 
favour of the working hypothesis, the study is terminated and a paper 
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is written. If the results look ‘promising’ but are not statistically signifi- 
cant, additional data are collected. This has been referred to as ‘sampling 
to a foregone conclusion’ and can lead to a high rate of false-positive 
findings””°. Therefore, sample size and rules for stopping data collec- 
tion should be defined in advance and properly reported. Unplanned 
interim analyses, which can inflate false-positive outcomes and require 
unblinding of the allocation code, should be avoided. If there are interim 
analyses, however, these should be reported in the publication. 


Ad hoc exclusion of data 

Animal studies are often complex and outliers are not unusual. 
Decisions to include or exclude specific animals on the basis of outcomes 
(for example, state of health, dissimilarity to other data) have the poten- 
tial to influence the study results. Thus, rules for inclusion and exclusion 
of data should be defined prospectively and reported. It is also important 
to report whether all animals that were entered into the experiment 
actually completed it, or whether they were removed, and if so, for what 
reason. Differential attrition between groups can introduce bias. For 
example, a treatment may appear effective if it kills off the weakest or 
most severely affected animals whose fates are then not reported. In 
addition, it is important to report whether any data were removed before 
analysis and the reasons for this data exclusion. 


Retrospective primary end-point selection 

It is well known that assessment of multiple end points, and/or assess- 
ment of a single end point at multiple time points, inflates the type-I 
error (false-positive results)*". Yet it is not uncommon for investigators 
to select a primary end point only after data analyses. False-positive 
conclusions arising from such practices can be avoided by specifying a 
primary end point before the study is undertaken, the time(s) at which 
the end point will be assessed, and the method(s) of analysis. Significant 
findings for secondary end points can and should be reported, but 
should be delineated as exploratory in nature. If multiple end points 
are to be assessed, then appropriate statistical corrections should be 
applied to control type-I error, such as Bonferroni corrections*'™’. 


Pseudo replicates 

When considering sample-size determination and experimental design, 
pseudo-replication issues need to be considered”. There is a clear, but 
often misunderstood or misrepresented, distinction between technical 
and biologic replicates. For example, in analysing effects of pollutants on 
reproductive health, multiple sampling from a litter, regardless of 
how many littermates are quantified, provides data from only a single 
biologic replicate. When biologic variation in response to some interven- 
tion is the variable of interest, as in many animal experiments, analysis of 
samples from multiple litters is essential. The unit of assessment is the 
smallest unit (animal, cage, litter) to which the intervention in question 
can be independently administered”. 


Small effect sizes 

A statistically significant result does not provide information on the 
magnitude of the effect and thus does not necessarily mean that the 
effect is robust, which could account for the poor reproducibility of certain 
studies**. Therefore, reporting whether results were substantiated by repe- 
tition, preferably under a range of conditions that demonstrate the robust- 
ness of the effect is encouraged. Also, reporting how often the particular 
experiment was performed as a means to control for a general tendency to 
publish only the best results would strengthen the validity of experimental 
results. To this end, carefully designed and powered animal studies should 
be budgeted for in the grant applications and funding agencies should 
consider supporting repetition studies where appropriate. 


An important note about exploratory experiments 

For the most part, these best practices do not apply to early-stage obser- 
vational experiments searching for possible differences among experi- 
mental groups. Such exploratory testing is frequently conducted using a 
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small sample size, does not have a primary outcome and is often 
unblinded. However, because such experiments are likely to be subject 
to many of the limitations described above, they should be viewed as 
hypothesis-generating experiments and interpreted as such. Potential 
discoveries arising from the exploratory phase of the research should be 
supported by follow-up, hypothesis-testing experiments that take into 
consideration and adequately report on the core standards detailed 
above (Box 1). 


The path to implementation 


Improving the transparency and quality of reporting cannot be achieved 
by a single party, but will require cooperation among all stakeholders, 
including investigators, reviewers, funding agencies and journals. 
Calling upon investigators to provide key information about the design, 
execution and analysis of animal experiments described in grant appli- 
cations and manuscripts and encouraging reviewers to consider these 
issues in their evaluations should, over time, increase both the quality 
and predictive value of preclinical research. Potential strategies for 
achieving this goal can be adopted from the clinical trials community, 
which also contended with poor reporting and associated bias. Evidence 
that clinical trials can yield biased results if they lack methodological 
rigor*’-** led to the development and implementation of the CONSORT 
guidelines for randomized clinical trials (among other guidelines), now 
adopted by many clinical journals and funding organizations. These 
guidelines require that authors report whether and how their studies were 
carried out blind and randomized, how sample size was determined, 
whether data are missing owing to attrition or exclusion, and supply 
information about other important experimental parameters*™. 
Importantly, the guidelines have improved the transparency of clinical 
study reporting in journals that have adopted them**’. Additional evid- 
ence for the power of such guidelines can be deduced from the obser- 
vation that, although few animal studies report on randomization, 
blinding or sample-size determination, most describe compliance with 
animal regulations, which is required by journals®??°°?*?, 

Asa first step, we recommend that funding organizations and journals 
provide reviewers with clear guidance about core features of animal study 
design (listed in Box 1). The goal is not to be prescriptive or proscrip- 
tive, but rather to delineate the minimum set of standards that should 
routinely be considered in evaluating the appropriateness of a study. Such 
guidance would make the task easier for reviewers of manuscripts and 
grant applications who volunteer their time and are often overextended. 
In addition, investigators and reviewers should be encouraged to consult 
published generic and model-specific guidelines for designing in vivo 
animal experiments®’’’”°***. To assist reviewers, editors and funding 
organizations in making sure that applications and manuscripts contain 
sufficient information on the core reporting recommendation (Box 1), 
authors could be asked to append relevant information ona standardized 
form that accompanies the submission. This form could be as simple as a 
checkbox indicating the page on which the key reporting standard is 
addressed. Such a form is currently used by clinical research journals. 

In addition to the measures proposed above, better dissemination 
of knowledge will be greatly facilitated by addressing publication bias, 
the phenomenon that few studies showing negative outcomes are 
published****. Such deficiency in reporting contributes to needless repe- 
tition of similar studies by investigators unaware of earlier efforts’. 
There is a widely accepted belief that the scientific community, promo- 
tions committees, funding agencies and journals favour positive out- 
comes, an impression that can lead to bias“. Possible solutions include 
incentivizing investigators to publish negative outcomes, supporting 
studies of independent replication, encouraging journals to publish a 
greater number of studies reporting negative outcomes, creating a data- 
base for negative outcomes (analogous to http://ClinicalTrials.gov/), 
and linking the raw data to publications. 

Change will not occur overnight. The importance of training scientists 
to properly design and adequately report animal studies cannot be over- 
stated. Training and education focused on key features of experimental 
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design should be an ongoing process for both the novice and veteran 
involved in biomedical research. Attention to better study design reporting 
should be communicated at major meetings, brought to the attention of 
reviewers, editors and funders, required by the publishers of peer-review 
journals, and included in the training program of graduate and postdoc- 
toral students. Furthermore, good mentorship is crucial for developing 
such skills and should be encouraged and rewarded. Rigorous experi- 
mental design and adequate reporting needs to be emphasized across the 
board and monitored in training grants awarded by the US National 
Institute of Health (NIH) and other funding agencies. Professional soci- 
eties can also have an important role by highlighting this issue in their 
respective communities. 

An important gatekeeper of quality remains the peer review of grant 
applications and journal manuscripts. We therefore call upon funding 
agencies and publishing groups to take actions to reinforce the import- 
ance of methodological rigor and reporting. NINDS has begun taking 
steps to promote best practices for preclinical therapy development 
studies. In 2011, a Notice was published in the NIH Guide encouraging 
the scientific community to address the issues described above in their 
grant applications, in describing both the project being proposed and the 
supporting data upon which it is based (http://grants.nih.gov/grants/ 
guide/notice-files/NOT-NS-11-023.html). Points that should be consid- 
ered in a well-designed study are listed on the NINDS website (http:// 
www.ninds.nih.gov/funding/transparency_in_reporting_guidance.pdf). 
Furthermore, the reviewers of applications reviewed by the NINDS 
Scientific Review Branch are reminded of these issues and asked to pay 
careful attention to the scientific premise of the proposed projects. 

We believe that improving how animal studies are reported will raise 
awareness of the importance of rigorous study design. Such increased 
awareness will accelerate both scientific progress and the development 
of new therapies. 
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A roadmap for graphene 


K. S. Novoselov’, V. I. Fal’ko?, L. Colombo’, P. R. Gellert*, M. G. Schwab° & K. Kim® 


Recent years have witnessed many breakthroughs in research on graphene (the first two-dimensional atomic crystal) as 
well as a significant advance in the mass production of this material. This one-atom-thick fabric of carbon uniquely 
combines extreme mechanical strength, exceptionally high electronic and thermal conductivities, impermeability to 
gases, as well as many other supreme properties, all of which make it highly attractive for numerous applications. Here 
we review recent progress in graphene research and in the development of production methods, and critically analyse 


the feasibility of various graphene applications. 


some of the currently used materials and leading to new markets? 

Is it versatile enough to revolutionize many aspects of our life 
simultaneously? In terms of its properties, graphene certainly has the 
potential. Graphene is the first two-dimensional (2D) atomic crystal avail- 
able to us. A large number of its material parameters—such as mechanical 
stiffness, strength and elasticity, very high electrical and thermal conduc- 
tivity, and many others'’—are supreme. These properties suggest that 
graphene could replace other materials in existing applications. However, 
that all these extreme properties are combined in one material means that 
graphene could also enable several disruptive technologies. The combina- 
tion of transparency, conductivity and elasticity will find use in flexible 
electronics, whereas transparency, impermeability and conductivity will 
find application in transparent protective coatings and barrier films; 
and the list of such combinations is continuously growing. However, is 
graphene special and versatile enough to justify the inconveniences of 
switching to a new technology, usually a lengthy and expensive process? 


C ould graphene become the next disruptive technology, replacing 


Graphene properties 


One reason that graphene research has progressed so fast is that the 
laboratory procedures enabling us to obtain high-quality graphene are 
relatively simple and cheap. Many graphene characteristics measured 
in experiments have exceeded those obtained in any other material, 
with some reaching theoretically predicted limits: room-temperature 
electron mobility of 2.5 X 10°cm* V's! (ref. 3) (theoretical limit‘ 
~2X10° cm?V 's_}); a Young’s modulus of 1 TPa and intrinsic 
strength of 130 GPa (ref. 5, very close to that predicted by theory®); very 
high thermal conductivity (above 3,000 W mK ~ 1 ref. 7); optical absorp- 
tion of exactly ma ~ 2.3% (in the infrared limit, where « is the fine 
structure constant)*; complete impermeability to any gases’, ability to 
sustain extremely high densities of electric current (a million times 
higher than copper)". Another property of graphene, already demon- 
strated''"’, is that it can be readily chemically functionalized. 

Graphene’s many superior properties justify its nickname of a 
‘miracle material’. However, some of these characteristics have been 
achieved only for the highest-quality samples (mechanically exfoliated 
graphene") and for graphene deposited on special substrates like 
hexagonal boron nitride*’*. As yet, equivalent characteristics have not 
been observed on graphene prepared using other techniques, although 
these methods are rapidly improving. Graphene will be of even greater 
interest for industrial applications when mass-produced graphene has 
the same outstanding performance as the best samples obtained in 
research laboratories. 


Nature provides us with many other 2D crystals, such as boron nitride 
and molybdenum disulphide’®. Being structurally related to graphene 
but having their own distinctive properties, they offer the possibility of 
fine-tuning material and device characteristics to suit a particular 
technology better or to be used in combination with graphene (for 
example, 2D-based heterostructures'”'*). Being part of such a large 
and diverse family of 2D crystals and heterostructures will improve 
graphene’s chances of commercial success, although we do not cover 
these other 2D crystals in this Review (see Box 1). 


Challenges in production 


The market of graphene applications is essentially driven by progress in 
the production of graphene with properties appropriate for the specific 
application, and this situation is likely to continue for the next decade or 
at least until each of graphene’s many potential applications meets its 
own requirements. Currently, there are probably a dozen methods being 
used and developed to prepare graphene of various dimensions, shapes 
and quality. Here we concentrate only on those that are scalable. 

It is logical to categorize these by the quality of the resulting graphene 
(and thus the possible applications): (1) graphene or reduced graphene 
oxide flakes for composite materials, conductive paints, and so on; (2) 
planar graphene for lower-performance active and non-active devices; 
and (3) planar graphene for high-performance electronic devices. The 
properties of a particular grade of graphene (and hence the pool of 
applications that can utilize it) depend very much on the quality of 
the material, type of defects, substrate, and so forth, which are strongly 
affected by the production method; see Fig. 1 and Table 1. 


Liquid phase and thermal exfoliation 

Liquid-phase exfoliation of graphite'*”® (or any other layered material’’) 
is based on exposing the materials to a solvent with a surface tension that 
favours an increase in the total area of graphite crystallites. The solvent is 
typically non-aqueous, but aqueous solutions with surfactant can also be 
used. With the aid of sonication, graphite splits into individual platelets, 
and prolonged treatment yields a significant fraction of monolayer flakes 
in the suspension, which can be further enriched by centrifugation. 

A related method is the graphite oxide route in which graphite pellets 
are first oxidized and then ultrasonically exfoliated in an aqueous solu- 
tion”. After exfoliation of graphite oxide the suspension may be further 
processed by centrifugation, and can then be deposited as a thin film on 
almost any surface and reduced (albeit partially) in situ back to the 
parent graphene state. 
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BOX | 
Off-road with other 2D atomic 
crystals and their heterostructures 


The study of graphene has triggered experiments on many other 2D 
atomic crystals, such as BN, NbSe2, TaS2, MoS2 and many others. 
Similar strategies to those applied to graphene can be used to obtain 
new 2D materials by mechanical?® or liquid-phase exfoliation of 
layered materials? or CVD growth. Another way to create new 2D 
crystals is to start with an existing one (like graphene) and use it as an 
atomic scaffolding, modifying it by chemical means (graphane?? and 
fluorographene?’ are good examples). The pool of possible 2D crystals 
is huge, covering a massive range of properties: from the most 
insulating to the most conductive, from the strongest to the softest. 

If 2D materials provide a large range of different properties, 
sandwich structures (made up of two, three, four or more different 
layers of such materials) can offer even greater scope. These 2D-based 
heterostructures!”"® can be tailored with atomic precision and 
individual layers of very different character can be combined, so the 
properties of these structures can be tuned to fitan enormous range of 
possible applications. Furthermore, the functionality of 
heterostructure stacks is ‘embedded’ in their design. The first 
examples have already started to appear, with vertical tunnelling 
transistors based on this type of heterostructures having been 
demonstrated recently, showing very promising characteristics®?. 


Box 1 Figure | Example of optically active 2D-based 
heterostructure. Two graphene layers are separated by several 
layers of boron nitride, which serve as a tunnelling barrier. A 
built-in electric field (created by the proximity of one of the 
graphene layers to a monolayer of MoS») separates the electron- 
hole pair, which is created by an incoming photon, resulting in a 
photocurrent. 


An industrially important variation of the fully aqueous-based 
graphite-oxide route makes use of a thermal-shock procedure to achieve 
exfoliation and reduction simultaneously*’. Even though the resulting 
material may contain graphene components with several layers, it still 
preserves many of the appealing properties of single-layer graphene. 
Similarly to oxidation, the parent graphite stacking can be disturbed 
via intercalation of small molecules. Such graphite intercalation 
compounds may then be used in a similar way as precursors and can 
subsequently be subjected to thermal or plasma processes to achieve 
their delamination into single sheets. 

Also, there are several methods of producing suspensions of graphene 
nanoribbons—via unzipping of single-wall carbon nanotubes”. 
Although they are more expensive than chemical exfoliation of graphite 
or graphite oxide, these methods allow one to achieve suspensions with 
well-defined distributions (potentially very narrow) of graphene 
platelets. Similarly, nanotube unzipping allows better control over the 
chemical functionalization and quality of the edges. 

Such bulk grades of graphene are already available on the tonne scale 
and are currently being evaluated in numerous fields of application”®. 
Thus, graphene-based paints and inks will find their way into printed 
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electronics, electromagnetic shielding, barrier coatings, heat dissipation, 
supercapacitors, smart windows”, and so on. A number of flake-based 
products can be expected in the marketplace within a few years, and 
prototype applications for conductive inks have already been demon- 
strated on the commercial level. 


Chemical vapour deposition 

Large-area uniform polycrystalline graphene films are now being grown 
by chemical vapour deposition (CVD) on copper foils and films, and 
show promise for many applications”*. Despite the fact that the complete 
process typically requires transfer from the copper support to a dielectric 
surface or other substrate of interest’, the production of square metres of 
graphene has already been achieved”. These films have also been trans- 
ferred onto 200-mm Si wafers on which state-of-the-art devices have been 
demonstrated. On a smaller scale, these films show transport properties 
equivalent to those of exfoliated graphene on both SiO, and hexagonal 
boron nitride substrates. Despite the presence of defects, grain boundaries, 
inclusions of thicker layers, and so on, such films are ready for use in 
transparent conductive coating applications (such as touch screens). 

At present, the process is expensive owing to large energy consumption 
and because the underlying metal layer has to be removed. However, once 
the transfer process is optimized this method may indeed be disruptive and 
cost-effective. A number of issues need to be resolved before graphene 
CVD technology can become widely used. Graphene growth on thin (tens 
of nanometres) films of metals needs to be achieved, simultaneously gain- 
ing control of the domain (grain) size, ripples, doping level and the number 
of layers. Control of the number and relative crystallographic orientation 
of the graphene layers is critical because it will enable a number of applica- 
tions which would require double, triple and even thicker layers of gra- 
phene. Simultaneously, the transfer process should be improved and 
optimized with the objectives of minimizing the damage to graphene 
and of recovering the sacrificial metal. 

The transfer process might be as complicated as the growth of 
graphene itself. However, there are a number of applications which rely 
on conformal growth of graphene on the surface of the metal, and do not 
require graphene transfer at all: high thermal and electrical conductivities 
as well as excellent barrier properties allow graphene greatly to enhance 
the performance of copper interconnects in integrated circuits. Also, 
because graphene is inert, it is an excellent barrier for any gas, and it 
forms a conformal layer on metal surfaces with the most complex 
topographies: such coatings can protect against corrosion. 

The game-changing breakthroughs would be the development of 
graphene growth on arbitrary surfaces and/or at low temperatures 
(for example, using plasma-enhanced CVD or other methods) with a 
minimal number of defects. The former would allow one to avoid the 
complex and expensive transfer step and promote better integration of 
this 2D crystal with other materials (like Si or GaAs). The latter would 
improve compatibility with modern microelectronic technologies and 
allow significant energy saving. 


Synthesis on SiC 

Silicon carbide is a common material used for high-power electronics. It 
has been demonstrated that graphitic layers can be grown either on the 
silicon or carbon faces of a SiC wafer by sublimating Si atoms, thus 
leaving a graphitized surface’. Initially, the C-terminated face of SiC 
was used to grow a turbostratic stack of many randomly oriented 
polycrystalline layers*’, but now the number of graphene layers grown” 
can be controlled. The quality of such graphene can be very high, with 
crystallites approaching hundreds of micrometres in size**. 

The two major drawbacks of this method are the high cost of the SiC 
wafers and the high temperatures (above 1,000 °C) used, which are not 
directly compatible with silicon electronics technology. There are 
potentially several ways to take advantage of the growth of graphene 
on SiC, including the growth of thin SiC on Si, although this approach 
requires further development. As a result of the high-temperature growth, 
high substrate cost, and small-diameter wafers, the use of graphene on SiC 
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Figure 1 | There are several methods of mass-production of graphene, 
which allow a wide choice in terms of size, quality and price for any 
particular application. 


will probably be limited to niche applications. High-frequency transistors 
based on SiC-grown graphene** may well find applications within a 
decade when the existing technology, based on III-V materials (such as 
InGaAs, GaN, and so on) reaches its limit at about 1 THz. The short gate 
transistors that are currently widely used make even the 20-ym size 
domains (currently achieved in graphene grown on SiC) suitable for such 
applications. Another very attractive, though niche, application of this 
type of graphene is in metrological resistance standards*, where samples 
of graphene grown on SiC have already been demonstrated to deliver 
higher resistance accuracy at higher temperatures than do conventionally 
used GaAs heterostructures. 

Apart from the high temperature required for growth, which cur- 
rently seems to be an insurmountable problem, the other issues that 
need to be addressed in the next decade are the elimination of terraces, 
the growth of the second or third layers at the edges of the terraces 
(which also strongly contribute to carrier scattering), an increase in 
the size of the crystallites and control of unintentional doping from 
the substrate and buffer layers. 


Other growth methods 

Although there are a number of other growth methods, it is unlikely that 
they will become commercially viable in the next decade. Nevertheless, 
some of these methods have certain advantages and should be 
researched further. Surface-assisted coupling of molecular monomer 
precursors into linear polyphenylenes with subsequent cyclodehydro- 
genation is an exciting way to create high-quality graphene nanoribbons 
and even more complex structures (like T- and Y-shaped connections)** 
using a chemistry-driven bottom-up approach. Molecular beam epitaxy 
has been used to grow chemically pure graphene”, but it is unlikely to be 
used on a large scale because of its much higher cost than CVD methods. 


Table 1 | Properties of graphene obtained by different methods 


Laser ablation is a potentially interesting growth technique allowing the 
deposition of graphene nanoplatelets on arbitrary surfaces**. This rela- 
tively expensive method is in direct competition with the spray-coating 
of chemically exfoliated graphene, so it is unlikely to be widely used. 


Graphene electronics 

It is unlikely that graphene will make it into high-performance inte- 
grated logic circuits as a planar channel material within the next decade 
because of the absence of a bandgap. However, many other, less strin- 
gent, graphene electronic applications are being developed, using the 
available (probably not ideal in terms of quality) material. Figure 2 and 
Table 2 list some of the possible applications and the time that it may 
take for graphene-based prototypes to be demonstrated. 


Flexible electronics 

Transparent conductive coatings are widely used in electronic products 
such as touch screen displays, e-paper (electronic paper) and organic 
light-emitting diodes (OLEDs) and require a low sheet resistance with 
high transmittance (of over 90%) depending on the specific application. 
Graphene meets the electrical and optical requirements (sheet resistance 
reaching 30Q per square of 2D area in highly doped samples) and an 
excellent transmittance of 97.7% per layer®, although the traditionally used 
indium tin oxide (ITO) still demonstrates slightly better characteristics. 
However, considering that the quality of graphene improves every year 
(already making the difference in performance marginal), while ITO will 
become more expensive and ITO deposition is already expensive, gra- 
phene has a chance of securing a good fraction of the market. Graphene 
also has outstanding mechanical flexibility and chemical durability—very 
important characteristics for flexible electronic devices”, in which ITO 
usually fails. 

The requirements of electrical properties (for example, sheet resist- 
ance) for each electrode type differ from application to application. 
Depending on the production methods, various grades of transparent 
conductive coating could be produced from graphene. Thus, electrodes 
for touch screens (although requiring an expensive CVD method of 
production) tolerate a relatively high sheet resistance (50-300Q per 
square) for a transmittance of 90%. The advantage of graphene electro- 
des in touch panels is that graphene’s endurance far exceeds that of any 
other available candidate at the moment. Moreover, the fracture strain 
of graphene is ten times higher” than that of ITO, meaning that it could 
also successfully be applied to bendable and rollable devices. 

Rollable e-paper is a very appealing electronic product. It requires a 
bending radius of 5-10 mm, which is easily achievable by a graphene 
electrode. In addition, graphene’s uniform absorption across the visible 
spectrum‘ is beneficial for colour e-papers. However, the contact resist- 
ance between the graphene electrode and the metal line of the driving 
circuitry is still a problem. A working prototype is expected by 2015, but 
the manufacturing cost needs to decrease before it will appear on the 
market. 

OLED devices have become an attractive technology and the first 
(non-graphene) products are expected on the market by 2013. Besides 


Method Crystallite size (tum) Sample size (mm) Charge carrier mobility (at ambient Applications 
temperature) (cm? V~!s~1) 
Mechanical exfoliation >1,000 >1 >2 x 10° and > 10° (at low Research 
temperature) 
Chemical exfoliation <0.1 Infinite as alayerof 100 (fora layer of overlapping flakes) Coatings, paint/ink, composites, transparent 


overlapping flakes 


conductive layers, energy storage, bioapplications 


Chemical exfoliation via = ~100 Infinite as alayerof 1 (fora layer of overlapping flakes) Coatings, paint/ink, composites, transparent 

graphene oxide overlapping flakes conductive layers, energy storage, bioapplications 

CVD 1,000 ~1,000 10,000 Photonics, nanoelectronics, transparent 
conductive layers, sensors, bioapplications 

SiC 50 100 10,000 High-frequency transistors and other electronic 


devices 
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Figure 2 | Graphene-based display and electronic devices. Display 
applications are shown in green; electronic applications are shown in blue. 
Possible application timeline, based on projections of products requiring 


strict requirements for the sheet resistance (below 30 per square), 
other crucial parameters for such devices are the work function and 
the electrode’s surface roughness, which effectively governs the per- 
formance. The tunability of graphene’s work function could improve 
the efficiency, and its atomically flat surface would help avoid electrical 
shorts and leakage currents. Graphene electrodes have already been 
demonstrated in OLED test cells. Advanced flexible or foldable 
OLED devices could be introduced after 2016 once device integration 
issues (such as conformal deposition of graphene on three-dimensional 
structures and contact resistance between graphene and the source/ 
drain) are resolved. 

In the low-cost sector everything is set up for mass production. Liquid- 
phase exfoliation produces such graphene coatings without the use of 
expensive vacuum technology. Although the resistance of these films is 
on the high side, they still perform well enough for smart windows, solar 
cells and some touch screen applications. Graphene has the important 
advantage of flexibility and mechanical strength, which ensures that 
graphene-based devices will probably dominate flexible applications. 


High-frequency transistors 

Graphene has been considered and intensively researched for high- 
frequency transistor applications**. However, it has to compete against 
more mature technologies such as compound semiconductors (III-V 
materials). Thus, graphene will probably be used only after 2021, when 
even III-V materials will fail to satisfy device requirements. Projections 
show that III-V materials will no longer be able to obtain the required 
cut-off frequency fr = 850 GHz (the top frequency for current modu- 
lation) and maximum oscillation frequency finax = 1.2 THz (the top 
frequency for power modulation) after 2021 because device require- 
ments will become more stringent. A recent graphene progress report’ 
presented a value of fy; as high as 300 GHz, with the possibility of extend- 
ing it up to 1 THz at a channel length of about 100 nm (ref. 41). On the 
other hand, finax has only reached 30 GHz in traditional graphene struc- 
tures, which is far from the 330 GHz Si high-frequency transistor per- 
formance, according to the 2011 International Technology Roadmap for 


Table 2 | Electronics applications of graphene 
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advanced materials such as graphene. The figure gives an indication of when a 
functional device prototype could be expected based on device roadmaps and 
the development schedules of industry leaders. 


Semiconductors (ITRS). Thus, the principal remaining research issue is 
the low value of fax for graphene transistors, which trails fy by an order 
of magnitude in a comparable conventional device. There are two ways 
to improve fmax: by lowering the gate resistance or the source-drain 
conductance at pinch-off (ref. 42). The former approach could be done 
using well-established semiconductor processes. The latter will require 
current saturation in the graphene high-frequency transistor, which will 
probably involve finding a new dielectric layer with properties similar to 
those of boron nitride* and compatible with modern semiconductor 
technology. An finax of 58 GHz has been reported“* using graphene on 
top of an exfoliated hexagonal boron nitride film*"’. 


Logic transistor 

It is widely accepted that Si technology will be extended to nearly or even 
below the 10-nm level. Graphene transistors might have an opportunity 
to replace the silicon technology only after 2020 (according to the 2011 
ITRS). 

Several research paths are being targeted at opening a bandgap in 
graphene: nanoribbon***’ and single electron transistor***” formation, 
bilayer control’*** and chemically modified graphene''’’. However, all 
of these approaches (apart from chemical modification) have so far been 
unable to open a bandgap wider than 360 meV (ref. 49), which limits the 
on/off ratio to about 10°, much less than the required 10°. Even worse, 
they also lead to the degradation of the carrier mobility in graphene”. 

The issue of the low on/off ratio is resolved in the new transistor 
designs, which exploits the modulation of the work function of gra- 
phene, gaining control over vertical (rather than planar) transport 
through various barriers”. Although such devices allow for spectacular 
on/off ratios of >10°, more work on integration is required to enable the 
use of graphene for logic applications after 2025. 

Graphene’s electrical and thermal conductivities as well as its excel- 
lent barrier properties might push this material towards being used as 
interconnects as well as for thermal dissipation in integrated circuits. 
Graphene can easily be grown on copper by CVD, so we might see this 
combination used for such applications. 


Application Drivers 
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Touch screen 


Graphene has better endurance than benchmark materials 


Requires better control of contact resistance, and the sheet 
resistance needs to be reduced (possibly by doping) 


E-paper 


High transmittance of monolayer graphene could provide visibility 


Requires better control of contact resistance 


Foldable OLED 


shorts and leakage current 


Graphene of high electronic quality has a bendability of below 5mm, 
improved efficiency due to graphene’s work function tunability, 
and the atomically flat surface of graphene helps to avoid electrical 


Requires better control of contact resistance, the sheet resistance 
needs to be reduced, and conformal coverage of three-dimensional 
structures is needed 
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Figure 3 | Graphene-based photonics applications. Optical applications are 
shown in pink; optical interconnect applications are shown in brown. Possible 
application timeline, enabled by continued advances in graphene technologies, 
based on projections of products requiring advanced materials such as 


Photonics 


Electrons in graphene behave as massless two-dimensional particles, 
which leads to a significant wavelength-independent absorption 
(ma = 2.3%) for normal incident light* below about 3 eV. Additionally, 
mono- and bi-layer graphene become completely transparent when the 
optical energy is smaller than double the Fermi level, owing to Pauli 
blocking’. These properties would suit many controllable photonic 
devices (Fig. 3 and Table 3). 


Photodetectors 
Graphene photodetectors are presently one of the most actively studied 
photonic devices. Unlike semiconductor photodetectors, which have 
limited detecting spectral width, graphene can in principle be used for 
a wide spectral range from ultraviolet to infrared. Another advantage of 
graphene is its high operating bandwidth, which makes it suitable for 
high-speed data communications. The maximum bandwidths of InGaAs 
(for optical communication) and Ge (for optical interconnection) photo- 
detectors are limited to 150 GHz (ref. 53) and 80 GHz (ref. 54) respect- 
ively, owing to the carrier transit times. The high carrier mobility of 
graphene enables ultrafast extraction of photo-generated carriers, 
possibly allowing extremely high bandwidth operation. The transit- 
time-limited bandwidth of graphene photodetectors is calculated®* to 
be 1.5 THz at the reported saturation carrier velocity”. In practice, the 
maximum bandwidth of a graphene photodetector would be limited to 
640 GHz by the time constant resulting from the capacitive (RC) delay, 
rather than the transit time. 

Owing to the absence of a bandgap, the graphene photodetector 
requires a different carrier extraction model from that of semiconductor 
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graphene. The figure gives an indication of when a functional device prototype 
could be expected based on device roadmaps and the development schedules of 
industry leaders. 


photodetectors. Currently, graphene photodetectors use the local poten- 
tial variation near the metal-graphene interfaces to extract the photo- 
generated carriers’. Photo-responses of up to 40 GHz (ref. 55) and 
10GHz (ref. 58) detector operation have been demonstrated. 
However, the maximum responsivity is low (a few mA Ww); ref. 58) 
in comparison to the required ~1 AW ') because of the limited 
absorption caused by the small effective detection areas and the thinness 
of graphene. 

There are several possible ways to improve the sensitivity of graphene 
photodetectors, such as by using plasmonic nanostructures for the 
enhancement of the local optical electric field or by integrating it with 
a waveguide to increase the light-graphene interaction length”. Given 
the maximum bandwidth of the Ge photodetector and optical inter- 
connection roadmap, a graphene photodetector with a bandwidth over 
100 GHz will be competitive after 2020, providing that a method com- 
patible with modern semiconductor technology of growing high-quality 
graphene (with mobility >20,000 cm? V's’ *) is secured. 


Optical modulator 

Optical modulators are one of the key active building blocks for optical 
interconnects used to encode transmission data by altering the properties 
of light® such as phase, amplitude, and polarization using electro- 
refraction or electro-absorption. Si optical modulators, such as Mach- 
Zehnder interferometers®', ring resonators® and electro-absorption 
modulators® are based on interference, resonance and bandgap absorp- 
tion, respectively. Their operating spectra are usually narrow, however, 
and their slow switching times limit operation bandwidths. For Si wave- 
guide modulators, a large resistance in the p-n junction through the Si 
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core regions is a problem, confining bandwidths to usually less than about 
50 GHz. 

Excellent optical modulator performance can be achieved by exploit- 
ing graphene’s ability to absorb a small amount of incident light over 
ultrawide ranges of wavelengths and its ultrafast response. To do this, 
the interband transitions of photo-generated electrons in a single gra- 
phene layer™ are modulated over broad spectral ranges by a drive volt- 
age, leading to operating speeds with bandwidth exceeding 1 GHz in the 
near infrared range*’. With some structural changes, an even wider 
operation bandwidth of more than 50 GHz has been suggested, using 
inter-gated dual graphene layers” to reduce the resistance in the RC 
delay time, offering a pathway to a regime of hundreds of gigahertz, 
although such developments are not expected before 2020. Graphene is 
also promising for THz-range wireless communications where optical 
losses are an order of magnitude smaller than those in noble metals. 


Mode-locked laser/THz generator 

Ultrafast passively mode-locked lasers have been used for various appli- 
cations in spectroscopy, material micromachining”, bio-medicine® and 
in security applications; they usually use saturable absorbers to cause 
intensity modulation by selectively transmitting high-intensity light only. 
Compared with the widely used semiconductor saturable absorbers”, 
graphene absorbs a significant amount of photons per unit thickness*® 
and therefore reaches saturation at a lower intensity over a wide spectral 
range””’', Ultrafast carrier relaxation time, controllable modulation 
depth, high damage threshold, high thermal conductivity* and wide 
spectral range tunability’? are other benefits of graphene-saturable 
absorbers”’. These applications need only a small area of graphene, so 
commercialization could take place even before 2020. 

Most studies are focused on fibre and solid-state lasers”, but 
graphene-saturable absorbers can also find applications in semiconductor 
laser technology. Optical interconnection with a wavelength-division- 
multiplexing scheme requires a laser array with different wavelengths. 
One way to provide many different wavelengths is to use a single laser 
with multiple longitudinal modes, such as a mode-locked laser’*. An 
actively mode-locked Si hybrid laser has been studied for this purpose”, 
but a graphene-saturable absorber could enable a passively mode-locked 
semiconductor laser with simple fabrication and operation. However, we 
expect this application will be useful only after developing a highly inte- 
grated optical interconnection around the late 2020s. 

THz generators can be used in various applications such as medical 
imaging, chemical sensors, and security devices. Early proposals based 
on THz electromagnetic wave generation use graphene as a gain 
medium to generate stimulated emission by optical pumping”. However, 
electrons and holes have similar mobility values, so the photo-Dember 
effect (formation of a dipole and resulting THz emission due to the 
difference in diffusion times of electrons and holes) may not be effective. 
Hence, it is difficult to obtain a continuous-wave operation overcoming 
stimulated emission thresholds without damaging the material. Recent 
studies on THz wave generation suggest using a pulsed excitation of 
single-layer graphene or using multilayer graphite”* under a femtosecond 
laser pulse field to generate carriers that will be accelerated to generate the 
THz wave. However, the intensity is 10°-10* times weaker than that 
generated from a III-V-semiconductor-based photoconductive antenna 
or resonant tunnelling devices”. Practical THz-wave generators using 
graphene are unlikely to emerge before 2030. 


Optical polarization controller 

Polarization controllers such as polarizers and polarization rotators are 
crucial passive components with which to manipulate the polarization 
properties of photons. The differential attenuation of the transverse 
magnetic mode due to the excitation of Dirac fermions can provide an 
excellent extinction ratio of 27 dB, covering very broad communication 
bands. Compact optical polarizers have been demonstrated in data- 
communication optical fibres integrated with graphene as an in-line 
conductive layer*®. High-quality millimetre-sized graphene needs to 


REVIEW 


be integrated with an optical fibre or silicon in a hybrid device. 
Therefore, if the graphene-processing technology matures, these devices 
could come into play as early as 2020. 

Faraday rotation is a popular way to manipulate light polarization*’. 
Landau quantization in the two-dimensional electron gas in graphene* 
results in a giant rotation with a fast response and a broadband 
tunability. Even larger polarization rotations can be achieved with 
multi-stacking graphene structures. Two polarizers combined with these 
Faraday rotators could be made into very compact hybrid isolators. 
However, a desirable magnetic field smaller than 1 T will be a serious 
challenge for graphene isolators, delaying its debut until the late 2020s. 


Composite materials, paints and coating 


Graphene-based paints can be used for conductive ink, antistatic, 
electromagnetic-interference shielding, and gas barrier applications. 
In principle, the production technology is simple and reasonably 
developed, with most of the major graphite mining companies as well 
as new start-up companies having programmes on liquid-phase or ther- 
mally exfoliated graphene. In addition, over the next few years chemical 
derivatives of graphene will be heavily developed to control the conduc- 
tivity and optical opacity of the products. 

Graphene is highly inert, and so can also act as a corrosion barrier 
against water and oxygen diffusion. Given that it can be grown directly 
on the surface of almost any metal under the right conditions, it could 
form a protective conformal layer, that is, it could be used on complex 
surfaces. 

The mechanical, chemical, electronic and barrier properties of gra- 
phene along with its high aspect ratio make graphene attractive for 
applications in composite materials. The commercial position held by 
carbon fibres, however, is so strong that graphene will need substantial 
development before it will be economically feasible to use it as the main 
reinforcement component. The target is to achieve a 250 GPa Young’s 
modulus at the price of €25 per kilogram. In addition, pure graphene 
might not have the same adhesion properties to the matrix as carbon 
fibres, which would require more chemical modification of graphene. 

An equally large market exists in bringing extra functionality to 
composites, where the scope of graphene might be large and possibly 
realized more rapidly. Graphene can contribute gas and moisture barrier 
properties, electromagnetic shielding, electrical and thermal conduc- 
tivity, and a strain monitoring capability to the surrounding polymer 
matrix. As an additive to a composite matrix polymer it might increase 
the operating temperature level of composites, reduce moisture uptake, 
induce antistatic behaviour, give lightning strike protection and 
improve composite compressive strength. There are also a number of 
applications for which it is difficult to use carbon fibres that would still 
benefit from excellent mechanical reinforcement (injection-moulded 
composites). 

Considering that many companies involved in the carbon business 
have already established programs on graphene and graphene-oxide 
production, it is possible to expect graphene-based composites to appear 
on the market within a few years. The real breakthrough, however, will 
be expected when graphene flakes over 10 1m in size are easily obtain- 
able—the dimension required to use in full the advantage of the high 
Young’s modulus of graphene**’. Fortunately, it has been demonstrated 
that graphite flakes thicker than one monolayer can provide a significant 
level of reinforcement*’, thus making the implementation of graphene- 
based composites realistic within a shorter time. 


Energy generation and storage 

There is a constant search for highly efficient renewable energy tech- 
nologies, and it would be surprising if graphene were not involved in this 
race. At present, most efforts are concentrated on solar cells, which 
could be divided into those where graphene acts as the active medium 
and those that use graphene as a transparent or distributed electrode 
material. The former use the same principle of operation as already 
discussed for photodetectors, and, in principle, would benefit from 
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uniform absorption over a broad spectrum®. However, owing to the low 
intrinsic optical absorption of graphene*, such devices would require 
complex interferometry or plasmonic enhancement structures” to 
achieve the desired responsivity, and thus are unlikely to be widely used 
soon. Instead, the use of graphene as a transparent electrode in either 
quantum dots or dye-sensitized solar cells has proved highly beneficial. 
Doping can vary the position of the Fermi level in graphene signifi- 
cantly, so such electrodes have been used both as electron® and hole* 
conducting media. With the cost of graphene produced by liquid-phase 
or thermal exfoliation going down” we can expect wide use of graphene 
in dye solar cells, especially in applications where mechanical flexibility 
is paramount. 

The use of graphene in next-generation lithium-ion batteries is 
currently being widely studied. Traditionally used in commercial 
lithium-ion batteries, cathodes frequently suffer from poor electrical 
conductivity, which is overcome by the addition of graphite and carbon 
black to the electrode formulation. Graphene, with its sheet-like mor- 
phology, would not only act as an advanced conductive filler but may 
also give rise to novel core-shell or sandwich-type nanocomposite struc- 
tures**. The resulting increase in electrical conductivity of these new 
morphologies would help in overcoming one of the key limitations of 
lithium-ion batteries—their low specific power density. Lastly, the high 
thermal conductivity of graphene may be advantageous when it comes 
to high current loads that generate significant amounts of heat within 
the battery system. As anodes, graphene nanosheets can be used to 
intercalate lithium reversibly into the layered crystals. Graphene 
nanosheets used in conjunction with carbon nanotubes and fullerenes, 
Cgo; increased the battery charge capacity*’. 

Supercapacitors (Fig. 4) are based on storage of energy within elec- 
trochemical double-layer capacitors**. The superior rate performance of 
state-of-the-art devices (compared to lithium-ion batteries) is based on 
predominantly electrostatic storage of electrical energy and is deter- 
mined by the combination of a high-surface-area activated carbon 
material and a nanoscopic charge separation at the electrode-electrolyte 
interface. Graphene is an obvious material choice for this application’, 
offering high intrinsic electrical conductivity, an accessible and defined 
pore structure, good resistance to oxidative processes and high temper- 
ature stability. Currently the prototype graphene-based electrochemical 
double-layer capacitors” lead the field in capacitance as well as energy 
and power densities. Although the characteristics of graphene super- 
capacitors are very encouraging, there are still issues which must be 
addressed before the commercial use of such systems. In particular, 
the irreversible capacitance of graphene-based supercapacitors is still 


Figure 4 | In a supercapacitor device two high-surface-area graphene-based 
electrodes (blue and purple hexagonal planes) are separated by a membrane 
(yellow). Upon charging, anions (white and blue merged spheres) and cations 
(red spheres) of the electrolyte accumulate at the vicinity of the graphene 
surface. The ions are electrically isolated from the carbon material by the 
electrochemical double layer that is serving as a molecular dielectric. 
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too high, which could probably be improved by reducing the number of 
defects or choosing a better electrolyte. 

There are also reports on the use of graphene nanosheets as a support 
material for platinum catalysts for fuel cells’. Unlike carbon black, 
which is the baseline support material for platinum catalysts, graphene 
decreases the platinum particle size to under a nanometre because of the 
strong interaction between the platinum atoms and graphene. The 
strong interaction of platinum and graphene and the small particle size 
is leading to increased catalytic activity in direct methanol fuel cells. 

Common benchmark materials in energy-related applications 
(graphite, carbon black and activated carbon) will only be replaced if 
graphene is proved to be superior in terms of both performance and cost. 
That graphene of suitable grades for such applications is already avail- 
able in scalable amounts” might speed its progression into real devices. 


Graphene for sensors and metrology 


Graphene, being a two-dimensional fabric and a surface without bulk, 
has properties that are extremely sensitive to the environment. Thus, it is 
natural to consider using graphene for sensor applications, from mea- 
surements of magnetic field to DNA sequencing and from the monitor- 
ing of the velocity of surrounding liquid to strain gauges. The latter (with 
either electrical or optical readouts) are probably the most competitive 
application. Graphene is the only crystal which can be stretched by 20%, 
thus enhancing the working range of such sensors significantly’. 

Currently, graphene gas detectors, although extremely sensitive, have 
only a minor competitive edge over existing devices. Low selectivity and 
poisoning by water limit their area of applicability, although such detec- 
tors can be produced so cheaply that they could be used in certain niche 
applications. Functionalization might improve the selectivity of graphene 
sensors, but because it is rather an expensive method, it is probably most 
suitable for bio-sensing. 

The major advantage of graphene sensors is their multi-functionality. 
A single device can be used in multidimensional measurements (for 
example, strain, gas environment, pressure and magnetic field). In this 
sense graphene offers unique opportunities. With the development of 
increasingly interactive consumer electronic devices, such sensors will 
certainly find their way into many products. 

The unique bandstructure of graphene, with its anomalously large 
energy splitting between the zero-energy and the first Landau levels, 
makes it an ideal material to develop the universal resistance standard 
based on the quantum Hall effect’. The precision of quantum Hall effect 
quantization of 0.1 parts per billion for epitaxial graphene grown on the 
Si face of SiC by far outperforms that in the traditionally used GaAs 
heterostructures**”!, and such devices are already being used by several 
metrological facilities. 


Bioapplications 

Graphene has a number of properties which make it potentially 
promising for bioapplications. Its large surface area, chemical purity 
and the possibility of easy functionalization provide opportunities 
for drug delivery. Its unique mechanical properties suggest tissue- 
engineering applications and regenerative medicine”. Its combination 
of ultimate thinness, conductivity and strength make it an ideal support 
for imaging biomolecules in transmission electron microscopy”. Also, 
chemically functionalized graphene might lead to fast and ultrasensitive 
measurement devices, capable of detecting a range of biological 
molecules including glucose, cholesterol, haemoglobin and DNA”. 

As a result of their large surface area and delocalized 1 electrons, 
graphene derivatives can solubilize and bind drug molecules and thus 
have the potential to be drug delivery vehicles in their own right if 
sufficiently high drug loadings and suitable in vivo drug distribution 
and release profiles can be achieved. Graphene is also lipophilic, which 
might help in solving another challenge in drug delivery—membrane 
barrier penetration (Fig. 5). Most of the limited work that has been done 
so far has focused on investigating the loading and in vitro behaviour 
for aromatic anticancer drugs such as doxorubicin’’. Intravenous 
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Figure 5 | Manipulating the hydrophilic-lipophilic properties of graphene 
(blue hexagonal planes) through chemical modification would allow 
interactions with biological membranes (purple-white double layer), such as 
drug delivery into the interior of a cell (blue region). 


administration of polyethylene glycol-modified graphene oxide, labelled 
with a near-infrared fluorescence dye but not carrying any drug, has 
shown passive tumour targeting in mouse xenograft models. The 
tumours were killed when irradiated with a low-power near-infrared 
laser, showing the potential of using graphene derivatives for photo- 
thermal cancer treatment”®. However, given the high safety, clinical and 
regulatory hurdles and long timescales associated with drug develop- 
ment, which are exacerbated when new materials are involved, it is 
unlikely that products using graphene-based drug delivery technology 
will be near the market before 2030. 

Tissue engineering is an emerging area of technology with potential 
for a significant impact on patient treatment across a range of disease 
areas, although as yet only a small number of potential products have 
entered clinical trials. Graphene could be incorporated into the scaffold 
materials used for tissue engineering to improve their mechanical 
(strength and elasticity) and selective barrier”’ properties and potentially 
to modulate their biological performance in areas such as cell adhesion, 
proliferation and differentiation”. 

Before graphene can fulfil its promise in the biomedical area we must 
understand its biodistribution, biocompatibility and acute and chronic 
toxicity under conditions that are relevant to exposure during manufac- 
ture and subsequent use. Ultimately, this will probably need to be done 
for the particular form of graphene being used in a given application 
because the outcome is likely to vary with size, morphology and 
chemical structure. In some cases it may also be possible to exploit the 
biological activity that gives rise to a particular toxicity profile. For 
example, a ‘toxic’ graphene derivative could potentially be therapeutic 
in its own right as an antibiotic or anticancer treatment. 


Conclusions 


Physicists are used to thinking of graphene as a perfect two-dimensional 
lattice of carbon atoms. However, the paradigm is now shifting as pure 
science opens new technology routes: even less-than-perfect layers of 
graphene can be used in certain applications. In fact, different applica- 
tions require different grades of graphene, bringing closer widespread 
practical implementation of this material. 

As the current market for graphene applications is driven by the 
production of this material, there is a clear hierarchy in how soon the 
applications will reach the user or consumer. Those that use the lowest- 
grade, cheapest and most available material will be the first to appear, 
probably in a few years, and those which require the highest, electronic- 
quality grades or biocompatibility may well take decades to develop. 
Also, because developments in the last few years were extremely rapid, 
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graphene’s prospects continue to improve. Nevertheless, established 
benchmark materials will only be replaced if the properties of graphene, 
however appealing, can be translated into applications that are suffi- 
ciently competitive to justify the cost and disruption of changing exist- 
ing industrial processes. 

Graphene is a unique crystal in the sense that it combines many 
superior properties, from mechanical to electronic. This suggests that 
its full power will only be realized in novel applications, which are 
designed specifically with this material in mind, rather than when it is 
used to replace other materials in existing applications. Interestingly, 
such an opportunity is likely to be provided very soon with development 
of such new technologies as printable and flexible electronics, flexible 
solar cells and supercapacitors. 
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Molecular machines governing 
exocytosis of synaptic vesicles 


Reinhard Jahn! & Dirk Fasshauer? 


Calcium-dependent exocytosis of synaptic vesicles mediates the release of neurotransmitters. Important proteins in this 
process have been identified such as the SNAREs, synaptotagmins, complexins, Muncl8 and Muncl3. Structural and 
functional studies have yielded a wealth of information about the physiological role of these proteins. However, it has 
been surprisingly difficult to arrive at a unified picture of the molecular sequence of events from vesicle docking to 
calcium-triggered membrane fusion. Using mainly a biochemical and biophysical perspective, we briefly survey the 
molecular mechanisms in an attempt to functionally integrate the key proteins into the emerging picture of the neuronal 


fusion machine. 


transmitter is released from nerve terminals during incoming 
action potentials (Fig. 1). Under resting conditions, synaptic 
vesicles are stored in the cytoplasm of the nerve terminal, with some of 
them attached to specialized sites at the presynaptic plasma membrane 
termed active zones. Active zones are composed of unique multidomain 
proteins that provide a scaffold for vesicle docking and participate in 
activating the release apparatus, referred to as priming. Priming probably 
involves several reactions, including some requiring metabolic energy. 
Docked and primed vesicles (termed readily releasable pool) are ready to 
go, and some do so spontaneously, with the transmitter released by a 
single vesicle giving rise to a miniature postsynaptic potential. When an 
action potential arrives, voltage-gated calcium channels open, with the 
resulting calcium influx stimulating the rate of exocytosis more than 
100,000 fold in a highly cooperative manner (for review see ref. 1). 
During the past two decades, the key proteins mediating neuronal 
exocytosis have been identified. Many of them belong to structurally 
conserved protein families including the SNAREs, Rab proteins, Sec1/ 
Munc18-like (SM) proteins, and a group of tethering proteins termed 
CATCHR (complex associated with tethering containing helical rods) 
proteins. Apparently, they form the core of an ancient intracellular 
fusion machine that diversified during evolution to adapt to the needs 
of specialized compartments. Neuronal exocytosis constitutes one of such 
adaptations, and specific regulatory proteins such as synaptotagmins and 
complexins evolved in the animal kingdom (for reviews see refs 2-8). 
Despite such progress, there is still a gap in understanding between 
the functional properties of synaptic exocytosis and the molecular 
features of the key proteins. Modern electrophysiological”'® and 
imaging approaches''"* provided a wealth of information about the 
number of docked and primed vesicles, the exchange rates of vesicles 
between different pools, their release probabilities, their kinetics of 
exocytosis, and the dependence of exocytosis on calcium. Thus, detailed 
job descriptions for the underlying molecular machines are available. 
However, whereas genetic perturbations were instrumental in defining 
the basic functions of the key proteins, it often proved difficult to assign 
them to a specific step in the exocytotic pathway. For instance, Munc18, 
synaptotagmin and even the SNAREs were shown to function in dock- 
ing as well as in priming and triggering. Conversely, specific steps such 
as docking are controlled by multiple proteins (see refs 7 and 11 for a 
more detailed discussion). It also often proved difficult to reconcile the 


E xocytosis and recycling of synaptic vesicles define how much 


physiological effects of the perturbations with the physicochemical 
properties of the proteins. Thus, the molecular mechanisms responsible 
for the attachment of synaptic vesicles to the active zone, for the activa- 
tion of the release machinery, and for calcium triggering of exocytosis on 
a millisecond timescale are only slowly emerging. 


SNARE proteins, the engine of membrane fusion 


The synaptic proteins synaptobrevin (also referred to as VAMP), syn- 
taxin 1 and SNAP-25 belong to the SNARE protein family. Their 
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Figure 1 | Trafficking pathways in the nerve terminal. Synaptic vesicles are 
filled with neurotransmitter and stored in the cytoplasm. Active vesicles are 
translocated to release sites in the active zone where they dock. Priming 
involves all steps required to acquire release readiness of the exocytotic 
complex. Although usually assumed to occur after docking, priming and even 
triggering may precede docking during sustained activity, resulting in 
immediate fusion of an arriving vesicle. After exocytosis, the vesicle proteins 
probably remain clustered and are then retrieved by endocytosis. Despite some 
lingering controversies, consensus is emerging that retrieval is generally 
mediated by clathrin-mediated endocytosis. After clathrin uncoating, synaptic 
vesicles are regenerated within the nerve terminal, probably involving passage 
through an endosomal intermediate. Actively recycling vesicles are in slow 
exchange with the reserve pool. See text for more details. 
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defining feature is an extended coiled-coil stretch, which is referred to as 
a SNARE motif and falls into four subtypes, referred to as Qa, Qb, Qc 
and R-SNARE motif (for example ref. 14). In syntaxin, synaptobrevin 
and in most other SNAREs the SNARE motifs are connected by a short 
linker to a carboxy-terminal transmembrane region (TMR). SNAP-25 
deviates from this general structure: here two SNARE motifs (Qb and 
Qc) are connected by a linker that is palmitoylated, whereas a TMR is 
lacking. Whereas synaptobrevin and SNAP-25 do not carry any other 
domains, syntaxin possesses an amino-terminal domain consisting of an 
antiparallel three-helix bundle, termed the Habc domain’*"*, connected to 
the SNARE motif by a flexible linker. Positioned N-terminally to the Habe 
domain is a short stretch that ends in the so-called N-peptide (see Fig. 2). 

SNAREs undergo a regulated assembly-disassembly cycle that is 
energized by the AAA+-ATPase NSF. Synaptobrevin is a synaptic 
vesicle protein, whereas syntaxin 1 and SNAP-25 are localized in the 
presynaptic plasma membrane. On contact, the SNAREs associate in 
trans at the N-terminal ends of the SNARE motifs. A tight bundle of four 
parallel «-helices is formed, each contributed by a different SNARE 
motif’”'*, which progresses towards the C-terminal membrane anchors 
(‘zippering’), thus pulling the membranes tightly together’’. Assembly is 
associated with a huge release of energy that is used to initiate membrane 
fusion’®’". After fusion, the ternary SNARE complex resides in the plasma 
membrane in the low-energy cis configuration and is disassembled by 
NSF in conjunction with its SNAP cofactor. Next, synaptobrevin is 
endocytosed and recycled, thus being able to participate in another 
round of exocytosis (for reviews see refs 2-8). 

Despite the elegant simplicity and experimental support” of the 
zippering model, SNARE assembly proved to be an unexpectedly 
complex reaction, and there is still a lot to learn. In vitro, isolated 
SNARE motifs are unfolded but assemble into diverse homo- and 
hetero-oligomers that all are at least partially helical (reviewed in refs 
4 and 8). For instance, SNAP-25 can bind sequentially two syntaxin 
molecules, thus blocking the binding site of synaptobrevin”. 
Furthermore, syntaxin rapidly switches between an active open con- 
formation and an inactive closed conformation in which the Habc 
domain folds against the N-terminal part of the SNARE motif**”’. 
Such conformational dynamics and kinetic trapping of off-pathway 
intermediates explains why in vitro assembly of the ternary complex, 
although highly exergonic, lasts hours, far too slow to mediate fast 
exocytosis. On the other hand, if a complex of SNAP-25 and syntaxin 
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Figure 2 | Schematic depictions of domain structures and crystal structures 
of core proteins of the neuronal fusion machine. The dashed lines between 
the N-peptide (N) and Habc domain represent flexible regions in syntaxin. For 
synaptotagmin the two Ca’* binding sites are indicated. Note that the domain 
structure of the large multi-domain protein Munc13 is shown five times smaller 
than those of the other domain structures. A high-resolution structure was 
obtained for the C-terminal half of the MUN domain. See text for details. The 
data for structures are from: Munc13-1 (Cand D subdomains)”, Synaptotagmin 
1 (C2A and C2B domain)’, Munc18-1 (blue-green, in complex with syntaxin 
(red))°*, Habc domain’®, SNARE complex"’, complexin®’. aa, amino acid. 
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with a free N-terminal binding site for synaptobrevin is stabilized, SNARE 
assembly is accelerated by orders of magnitude”®. The central problem is to 
delineate precisely the assembly pathway and to understand how the 
SNARE molecules are channelled along this pathway by regulatory 
proteins to execute fusion efficiently. Four proteins, each representing a 
small protein family, have emerged as such key regulators: Muncl8 and 
Muncl13 that prepare the SNARE engine for assembly, and synaptotagmin 
and complexin that govern calcium-dependent triggering. 


Priming the SNARE engine 


UNC-18 and UNC-13, the Caenorhabditis elegans orthologues of 
Muncl8 and Muncl3, respectively, were originally identified by S. 
Brenner in his classical screen uncovering genes involved in move- 
ment’. Deletion of either Munc18 (ref. 28) or Muncl3 (ref. 29) and 
their respective orthologues***' completely inhibits neuronal exocytosis. 
Muncl18 belongs to the conserved family of SM proteins. It possesses an 
arch-shaped architecture with a central cavity for high-affinity binding 
to syntaxin-1 (refs 32, 33). By contrast, the large Muncl13s belong to the 
CATCHR protein family**. Munc13 also binds to syntaxin-1 but only 
with moderate affinity***°. Both proteins are involved in setting up the 
SNAREs for assembly and perhaps in guiding them through the initial 
part of the assembly pathway, but it is still not understood how exactly 
they operate, how many copies are required to carry out the reaction, 
and how the extraordinary phenotypes of the knockouts can be mechan- 
istically explained. 


Muncl8 

For many years, the molecular mechanism of Muncl8 has been 
shrouded by a paradox because it locks syntaxin-1 in a closed conforma- 
tion**”’ (Fig. 2), in which syntaxin cannot enter SNARE complexes. 
Such inhibition is difficult to reconcile with the complete loss of 
exocytosis in deletion mutants, which suggests exactly the opposite, 
namely that SNARE zippering is absolutely dependent on Muncl8. 
Indeed, Muncl8 seems to be an oddity because other SM proteins, 
despite high structural similarity, bind instead tightly to the N-peptide 
of their cognate syntaxins, involving a binding site on the surface of the 
SM protein. This binding mode would enable these syntaxins to remain 
open, with SNARE assembly not being inhibited, whereas syntaxin-1 
would need to be opened in the case of Munc18. To reconcile these 
discrepancies, it was proposed that binding of SM proteins to syntaxins, 
whether via the N-peptide or the Habc domain, merely serves to recruit 
the SM protein to the prospective fusion site. The SM proteins are 
then handed over to the SNARE motifs where they promote nucleation 
and/or zippering (for reviews see refs 3 and 37). 

Recently, it has been recognized that SM proteins, including Munc18, 
generally bind to their respective syntaxins using both of the spatially 
distinct binding sites, but with different relative affinities*”**”°. In fact, 
the two binding sites seem to act together in controlling SNARE com- 
plex formation”’. This sheds new light on the paradox, as full ‘opening’ 
of syntaxin may not be required for gating entry into SNARE complexes. 
In support of this view, a syntaxin mutant originally thought to be 
constitutively open (LE mutant)** is now known to bind Muncl8 via 
both sites in an at least partially closed conformation, but without 
inhibiting formation of SNARE complexes**”*. Indeed, when expressed 
as the only syntaxin 1 variant, the LE mutant results in enhanced 
spontaneous exocytosis, supporting that under resting conditions it is 
more reactive with respect to SNARE binding”. 

Thus it seems that binding of Munc18-1 to both the closed conforma- 
tion and to the N-peptide of syntaxin la is an integral part of the 
pathway during which Muncl8 guides syntaxin towards productive 
SNARE complex formation. Perhaps Munc]18 first keeps syntaxin closed 
and inactive, thus preventing premature SNARE assembly, but allows 
for synchronization of a subsequent (calcium-dependent?) activation 
step (see for example ref. 3). 

Despite such progress, it is still unclear why Muncl18 is essential for 
efficient SNARE nucleation. Reconstitution experiments involving 
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liposomes suggest that Muncl8 participates in selecting the correct 
R-SNARE helix and thus guides nucleation of the ternary complex 
(for example refs 41 and 42). It is unclear whether it then remains 
associated with the ternary complex’ as also suggested for other SM 
proteins (for example ref. 37) or dissociates upon zippering**. In the 
latter case Muncl8 might be interacting with a trans-SNARE complex 
only during its initiation, whereas progression of zippering would cause 
syntaxin to fully open, thus driving off Muncl18. 


Muncl3 

Munc13s are modular proteins sharing a conserved C-terminal region 
containing a phorbol-ester-binding Cl domain and two calcium- 
binding C2 domains that flank a larger, so-called MUN domain 
(Fig. 2, reviewed in ref. 45). Expression of the MUN domain alone 
partially rescues the total arrest of exocytosis in neurons lacking 
Muncl3s*°, identifying it as a key functional element of the protein. 
MUN domains are shared with the proteins BAP3 and CAPS (Unc31) 
and with other proteins in most eukaryotes’. The MUN domain is 
structurally strikingly similar to other CATCHR family members” that 
work in various trafficking steps. These proteins form elongated arrays 
of stacked o-helical bundles with flexible hinge regions, which tether 
transport vesicles to the site of fusion. It is conceivable that the con- 
served MUN domain serves as binding platforms that arrange the core 
fusion machinery, whereas the Cl and C2 domains mediate fine-tuning 
of its membrane recruitment, a feature ideally suited to Ca**-regulated 
secretion. 

CATCHR complexes are also thought to enable SNARE assembly, 
although their interplay with SNARE proteins seems to vary. For 
instance, Muncl3 may participate directly by unlocking syntaxin from 
the grip of Munc18, because in C. elegans, the LE mutant of syntaxin 
(that is not inhibited by Munc18 binding) partially rescues neurotrans- 
mitter release in the absence of UNC-13 (refs 49 and 50). Furthermore, 
recent experiments have shown that the isolated MUN domain accel- 
erates the transition of syntaxin-1 from the Muncl8-1 complex to the 
SNARE complex*’. It should be kept in mind, however, that the LE 
mutant also partially rescues the block of exocytosis caused by deletion 
of RIM (also known as UNC-10) in C. elegans*'. RIM serves as central 
organizer of the active zone. It forms a tripartite complex with the 
N-terminal C2A domain of Munc13 and the small vesicular GTPases 
RAB3 and RAB27, thus orchestrating the attachment site of synaptic 
vesicles (reviewed in ref. 2). 


Ca?* -dependent triggering starts the SNARE engine 


In contrast to the basic fusion reaction that is carried out by conserved 
proteins traced back to an ancient eukaryotic machine, the unique 
features of calcium-triggered exocytosis are primarily encoded in 
specialized proteins. Of these, synaptotagmins I, II and IX constitute 
the dominant calcium sensors whose deletion results in a complete loss 
of fast, calcium-triggered exocytosis (reviewed in refs 52-54). However, 
asynchronous (that is, slower) calcium-dependent release persists, 
showing that other calcium-binding proteins are involved, with candi- 
dates including other synaptotagmin isoforms or related proteins such 
as Doc2 (refs 55-57). Furthermore, complexins I and II are involved 
in triggering: deletion of complexins strongly reduces calcium-evoked 
exocytosis, whereas both stimulatory and inhibitory effects were 
observed on spontaneous release (for example see refs 58 and 59). 


Synaptotagmins 

The neuronal synaptotagmins are anchored to synaptic vesicles by a 
single TMR. Characteristic features of the synaptotagmins are two C2 
domains, called C2A and C2B, that are connected to the membrane as 
well as to each other by flexible linkers. C2 domains are rigid, oval- 
shaped B-sandwiches that possess a cluster of calcium-binding loops, 
serving as partial coordination site for two (C2A) or three (C2B) calcium 
ions. In the presence of calcium, the C2 domains bind to membranes 
containing acidic phospholipids that complete the calcium coordination 
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sites. In addition, the C2B domain contains a spatially separated basic 
patch that steers the domain to membranes enriched in phosphatidyli- 
nositol (4,5) bisphosphate (PI(4,5)P2). Membrane binding is primarily 
electrostatic and rapidly reversed by chelating calcium or increasing the 
ionic strength. Furthermore, the synaptotagmin C2 domains bind to 
syntaxin alone or syntaxin-containing SNARE complexes (for example 
see refs 60 and 61). Although binding occurs in the absence of calcium, it 
appears to be influenced by calcium (reviewed in refs 54 and 62). 


Complexins 

Complexins are small cytoplasmic proteins that bind via a central helix 
to a groove on the surface of the SNARE complex, which is formed by 
the helices of syntaxin and synaptobrevin®™ (Fig. 2). Because SNARE 
binding is required for their physiological action, complexins can only 
exert their function once SNAREs are at least partially assembled, 
placing them into the reaction sequence after zippering is initiated. 
Intriguingly, the central helix is not sufficient for complexin function. 
Rather, the N-terminal end pointing towards the membrane is needed 
for facilitation of fusion, whereas the regions flanking the central helix 
seem to have an inhibitory role. To accommodate the presumed dual 
stimulatory and inhibitory role of complexin, two alternative molecular 
mechanisms are discussed****"*’. First, binding to the surface of the 
SNARE complex may promote initiation and progression of zippering, 
for example, by stabilizing partially zippered SNARE complexes and 
sensitizing them to activation by synaptotagmin (‘super-priming’) (for 
example see ref. 58). Second, complexin acts as a clamp that blocks 
progression of SNARE-zippering, presumably by competing directly 
with synaptobrevin binding in the C-terminal part of the SNARE com- 
plex (for example see refs 59 and 68). The clamp is released upon 
calcium triggering, probably by synaptotagmin (see below) because 
complexins do not bind calcium. 


Two models explain the action of calcium 

Despite many years of research, it is still controversial as to how calcium 
influx brings about the extraordinary and highly cooperative accelera- 
tion of exocytosis. To some extent this is owing to the fact that the 
molecular status of a docked and primed vesicle, ready to respond to 
calcium by exocytosis in less than a millisecond, is not known with 
certainty. 

Most authors seem to agree that SNAREs are already partially zippered 
in this state, with full zippering being prevented either by an energy 
barrier in the fusion pathway that the SNAREs alone cannot overcome 
(for example, electrostatic repulsion, transition towards a stalk inter- 
mediate, see below), and/or by an interfering protein, with prime candi- 
dates being complexins and/or synaptotagmins (Fig. 3, pathway I). 
During this state, Munc18, and perhaps also Munc13, may still be bound 
to the complex. It is debated whether such a complex is strained, that is, 
storing energy that is released during fusion, or whether it is relaxed, with 
the linkers connecting the zippered part of the complex to the membrane 
being flexible. 

Calcium binding to synaptotagmin would trigger fusion either by 
activating (disinhibiting) the SNAREs or by lowering the activation 
energy barrier in the fusion pathway through membrane interactions. 
Accordingly, synaptotagmin may act by (1) disengaging from the 
SNAREs, thus relieving the block (fusion clamp model)”, (2) binding 
to the SNAREs, thus displacing the inhibitory complexin and/or pro- 
moting zippering”, (3) binding to the membrane directly adjacent to the 
partially complexed SNAREs, thus destabilizing the bilayer at the fusion 
site”’”*, (4) increasing curvature stress by displacing lipids in the 
monolayer of the plasma membrane facing the vesicle”*”*, and (5) 
cross-linking the vesicle and the plasma membrane, thus accelerating 
fusion by charge compensation owing to the positive electrostatic poten- 
tial of the C2 domains”. 

A wealth of evidence is invoked in support of a partially zippered and 
arrested SNARE complex; for instance, differential effects of SNARE 
mutations on fusion kinetics that affect nucleation and zippering, 
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Figure 3 | Alternative models describing the steps between priming and 
fusion. Priming I involves arrest of a partially zippered SNARE complex, here 
shown with bound Muncl18, Muncl3 and synaptotagmin. Calcium influx 
triggers binding of synaptotagmin to the SNARE complex and to the plasma 
membrane (involving PI(4,5)P>, not shown here), associated with displacement 
of complexin and (possibly) Munc18 and/or Munc13. Priming II involves 
arrest after positioning of the vesicle with the aid of active zone components and 
(possibly) contact of synaptotagmin with P1(4,5)P. in the plasma membrane, 
but no contact between the SNAREs. Ca’ -triggering pulls the vesicle closer via 
synaptotagmin-mediated cross-linking, resulting in SNARE assembly, 
associated with full opening of syntaxin and displacement of Munc18, and 
binding of complexin. See text for details. 


respectively (see for example refs 76-78). The model also allows for an 
integration of complexin into the fusion mechanism that needs at least 
partial SNARE assembly before it can bind and exert its action. 
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Furthermore, the model intuitively explains the fast fusion kinetics 
because only minor conformational rearrangements are required upon 
Ca** -triggering, with all proteins already being correctly positioned for 
the final step. 

On the other hand, there are problems with this model, which in our 
opinion have not been sufficiently appreciated. Experimentally, trans- 
complexes are difficult to capture. Similarly, despite hints, for example, 
from single-molecule experiments*”’, an effect of synaptotagmin and/or 
complexin on the rate of SNARE assembly has remained elusive. Most 
importantly, the mechanisms proposed for arresting SNARE zippering 
somewhere in the middle are difficult to reconcile with the fact that 
SNARE assembly proceeds along a steep downhill energy gradient. 
For instance, a C-terminal fragment of synaptobrevin forms in vitro a 
stable complex with SNAP-25 and syntaxin, thus blocking, like a brake 
shoe, the C-terminal portion from assembling as envisioned in the 
partially zippered model. However, full-length synaptobrevin is able 
to rapidly displace this fragment”® and, despite this additional energy 
barrier, to promote fusion in vitro even if only one of such partially 
inhibited SNARE complexes is involved*. None of the proposed factors 
(including complexin) binds with an affinity even remotely comparable 
to that of the synaptobrevin fragment, questioning their ability to 
influence the strongly exergonic zippering reaction. Furthermore, we 
consider it unlikely that the control of the neuronal SNAREs is exerted 
by tinkering with its structurally highly conserved engine core, that is, 
the helical bundle whose major features (structure, stability, folding- 
unfolding hysteresis) are remarkably similar between SNAREs in regu- 
lated and non-regulated trafficking steps’. 

Similarly, a role of synaptotagmin in destabilizing membranes or 
inducing curvature stress is difficult to reconcile with Ca**-dependent 
membrane binding being primarily electrostatic and reversible. Indeed, 
vesicle deformation in vitro requires saturation with C2 domain- 
containing synaptotagmin fragments’*”, whereas only few phospholipid 
molecules are expected to be displaced by binding of single C2 domains, 
hardly sufficient to create even local tension. Generally, agents increasing 
positive spontaneous curvature of the proximal monolayers inhibit 
rather than enhance fusion*', that is, exactly the opposite of what 
synaptotagmin is doing, requiring elaborate models of highly organized 
‘bulges’ to explain promotion of fusion intermediates”. 

Recently, an alternative scenario for the docked and primed state has 
been envisaged that, despite being far from proven, we consider as an 
interesting alternative, as it overcomes several of the problems outlined 
above’”’*’ (Fig. 3, pathway II, see also ref. 40). Supported by recent 
electron tomography data showing that docked vesicles appear to be a 
few nanometres away from the plasma membrane''”’, it assumes that 
SNAREs do not connect in trans before the arrival of the calcium signal. 
Rather, the interaction of the vesicle with the active zone components 
(most notably Muncl13 and RIM) would precisely position the vesicle on 
top of a patch of plasma membrane containing activated SNARE 
acceptor complexes, probably complexed with Munc18. In this state, 
vesicle-bound synaptotagmin may be already in contact with the plasma 
membrane, either by (calcium-independent) binding to the SNAREs or 
by loosely binding to PI(4,5)P2 patches colocalizing with syntaxin clus- 
ters. Calcium influx would trigger membrane binding and cross-linking 
of the vesicle and plasma membrane, thus nudging them a bit clo- 
ser”? 4, sufficient to allow for rapid binding of synaptobrevin to the 
acceptor complex’*. Once nucleation is triggered, the SNAREs quickly 
progress through zippering and fusion. 

This model places the entire control of the neuronal fusion machine 
upstream of SNARE nucleation, which has important consequences for 
our understanding of the partial reactions. Most importantly, it changes 
the view of SNARE function. Accordingly, SNAREs act as ‘single shot’ 
devices that, once nucleation is triggered, are unstoppable and flash 
through assembly to bring about fusion. ‘Misfiring’ of SNAREs (assembly 
without fusion) probably only occurs rarely, if at all, but is likely to 
increase in mutants affecting zippering®. Also, it is possible that 
nucleation triggers the displacement of Munc18 and other factors (such 
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as Munc13), thus allowing the SNAREs to carry out the work unhindered 
by bulky bound proteins. Such a simple and highly efficient mode of 
operation may explain why the SNARE engine was so successful in 
evolution. The proposed function for synaptotagmin is in line with the 
function of C2 domains in other proteins such as protein kinase C—they 
operate as electrostatic switches” mediating calcium-dependent rapid 
and reversible membrane binding. 

The model also elegantly explains why solutions of high osmolarity 
(usually sucrose) trigger calcium-independent exocytosis of the readily 
releasable pool**: the resulting water efflux creates negative pressure that 
draws docked vesicles closer to the plasma membrane, triggering SNARE 
firing. Furthermore, any destabilization of the overall architecture of the 
docking site, which increases Brownian fluctuations of the vesicle, would 
cause occasional spontaneous firing of SNAREs, which may explain 
changes in spontaneous release rates upon deletion or overexpression 
of some proteins (for a detailed discussion see for example refs 9 and 57). 

Finally, the model provides for a fresh look at the molecular basis of 
the high cooperativity of calcium-triggered fusion. At non-saturating 
calcium concentrations, synaptotagmin binding may be less tight or 
transient, perhaps undergoing rapid and repetitive ‘on-off cycles, result- 
ing in vesicle jittering. Accordingly, the probability for SNARE 
nucleation/firing would be reduced. Such a scenario may also explain 
the function of complexin, which is otherwise more difficult to integrate 
into this model. Complexin may increase the frequency of successful 
nucleation events by stabilizing correctly oriented syntaxin-synaptobrevin 
alignments. This hypothesis is in line with the ability of complexin to bind 
to SNARE complexes with fast, diffusion-limited kinetics*’. 


Fusion—interplay between proteins and lipids 


In the final step of exocytosis the vesicle membrane fuses with the 
plasma membrane. The merger of two bilayers involves non-bilayer 
intermediates at the contact site that ultimately develop into the opening 
of an aqueous channel, termed a fusion pore. During fusion the hydro- 
phobic barrier separating the cytoplasm from both the vesicle content 
and the extracellular space must remain intact. 

Key issues concerning the molecular rearrangements of proteins and 
membrane lipids along the fusion pathway are unresolved. Popular 
models requiring an oligomeric ring of SNARE complexes surrounding 
the prospective fusion pore as intermediate cannot be maintained in 
view of the fact that only one to two (or three) SNARE complexes are 
sufficient for fusion both in vitro and in vivo*”. Intriguingly, in vitro 
fusion can be mediated by trans assembly of artificially engineered 
molecules mimicking SNARE-zippering (even DNA) as long as they 
possess membrane anchors (for example see refs 91-94). Such a lack 
of structural specificity in catalysis is indeed a hallmark of membrane 
fusion, and it is likely that considerable structural variety is tolerated 
along the fusion pathway. This helps to explain why unrelated classes of 
fusion proteins evolved in parallel to the SNAREs, such as those fusing 
cells”, viruses”®, mitochondria” or the endoplasmic reticulum”. 

The stalk hypothesis, first developed 30 years ago”’, describes mem- 
brane fusion as an ordered sequence of steps initiated by an hourglass- 
shaped intermediate (the fusion stalk), followed by a hemifusion 
diaphragm and subsequent rupture, resulting in the formation of a 
fusion pore (Fig. 4). Indeed, stalk-like intermediates can be induced as 
a separate phase under mild conditions’. However, the energy land- 
scape as well as the intermediate molecular structures along the fusion 
pathway is unclear. 

Originally, the energy profile was modelled on the basis of the elastic 
properties of membranes, with the curvature stress of the intermediate 
model structures defining transition-state energies. However, these 
energies were unrealistically high, and molecular parameters were 
invoked to lower the energies (for review see ref. 103). More recently, 
coarse-grain or even atomistic simulations of fusion have provided 
detailed scenarios for intermediate structures (Fig. 4), with con- 
sequences for the energy landscape. For instance, it has been suggested 
that ‘splaying’ of phospholipid tails may form the first hydrophobic 


REVIEW 


Hemifusion 


Proximal monolayer 


Distal monolayer Stalk 


Fusion pore Fusion pore 


Figure 4 | Transition states during membrane fusion. Intermediates of the 
fusion pathway. The top drawings represent intermediate states of the 
membrane along the pathway as predicted by the elastic theory. Below, 
snapshots of intermediate states of a simulation of SNARE-mediated fusion are 
shown, which, although roughly corresponding to the elastic model, differ in 
detail and in their energy predictions (adapted from ref. 109, courtesy of 

J. Rissellada and H. Grubmiiller). 


connection between the membranes’ from where stalk formation 
proceeds downhill an energy gradient. Furthermore, the enhanced 
fusogenicity of curved membranes can also be explained by the 
hydrophobic effect: owing to the increased spacing of the hydrophilic 
head groups the membrane surface is more hydrophobic. Lipid splaying 
requires the membranes to be at a critical distance of less than 1nm 
(Fig. 4, see ref. 105 for a more detailed discussion). 

These considerations have important consequences for the mech- 
anism of SNARE-catalysed fusion. Certainly, zippering of the four-helix 
bundle brings the membranes in close proximity, but the question is 
how the SNAREs promote stalk formation and subsequent intermediate 
structures. If the main energy barrier is contributed by curvature 
stress'°’, stiffness of the linkers connecting SNARE motifs and TMRs 
is essential for transmitting stress to the membranes. Indeed, mutagenesis 
of the linkers generally reduces fusion efficacy (see for example ref. 107), 
and at least syntaxin seems to have a stiff linker as a monomer’”’. On the 
other hand, if close proximity, water removal, increase of local hydro- 
phobicity and lipid splaying form the main energy barrier, bending stiff- 
ness of the SNARE linkers may not be as relevant, a view suggested by 
recent simulation studies’. Instead, the pulling force exerted during 
zippering may drag the TMRs along with some phospholipids slightly 
out of the membrane, thus initiating phospholipid splaying once the 
critical distance has been reached. 

What are the next steps? Transient hemifusion intermediates (experi- 
mentally defined, for example, by lipid mixing in the absence of content 
mixing) are observed upon SNARE-mediated fusion of liposomes, 
suggesting hemifusion as a metastable intermediate (for example see 
refs 110-114). However, it is experimentally difficult to differentiate 
between stalk and hemifusion intermediates. Hemifusion constitutes 
the lateral expansion ofa stalk, leading to the formation of a hemifusion 
diaphragm (Fig. 4). It remains to be seen whether such diaphragms 
represent intermediates along the fusion pathway or whether they are 
dead-ends as previously suggested for viral fusion proteins (see ref. 115 
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for a review). In any case, the job of the SNAREs is not finished before 
the initial opening of the fusion pore, with interactions between the 
linkers as well as the TMRs probably being involved'"®. 


Conclusion 


The molecular basis of synaptic exocytosis has fascinated scientists for 
decades. Since the initial discovery of quantal release in the 1950s by 
Katz and colleagues, and the elucidation of the synaptic vesicle-recycling 
pathway by Heuser and Ceccarelli in the 1970s, we have come a long way 
in deciphering the steps of the vesicle cycle at an increasingly detailed 
level. Although we have focused here on only a few key components, the 
vesicle cycle is governed by hundreds of proteins, and there are still new 
proteins being put on the map. We are only beginning to understand the 
rules by which individual protein-protein interactions work together in 
supramolecular machines to yield the synaptic vesicle cycle that reliably 
operates millions of times. These machines assemble on demand and 
disassemble when the task is completed. They are highly robust, tolerate 
varying stoichiometries, flexible compositions and other disturbances, 
and are controlled by an array of regulators such as protein kinases 
and phosphatases. Advances in technologies such as super-resolution 
microscopy, single-molecule measurements, fluorescent reporters 
and cryo-electron tomography are all contributing to closing the gap 
between our understanding of partial reactions in vitro and the 
fascinating efficiency of the vesicle cycle in intact synapses. 
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The hexadehydro- Diels-Alder reaction 


Thomas R. Hoye’, Beeraiah Baire', Dawen Niu’, Patrick H. Willoughby! & Brian P. Woods! 


Arynes (aromatic systems containing, formally, a carbon-carbon triple bond) are among the most versatile of all 
reactive intermediates in organic chemistry. They can be ‘trapped’ to give products that are used as pharmaceuticals, 
agrochemicals, dyes, polymers and other fine chemicals. Here we explore a strategy that unites the de novo generation of 
benzynes—through a hexadehydro-Diels—Alder reaction—with their in situ elaboration into structurally complex 
benzenoid products. In the hexadehydro-Diels-Alder reaction, a 1,3-diyne is engaged in a [4+2] cycloisomerization 
with a ‘diynophile’ to produce the highly reactive benzyne intermediate. The reaction conditions for this simple, thermal 
transformation are notable for being free of metals and reagents. The subsequent and highly efficient trapping reactions 
increase the power of the overall process. Finally, we provide examples of how this de novo benzyne generation approach 


allows new modes of intrinsic reactivity to be revealed. 


ortho-Benzyne (1,2-didehydrobenzene, 1, Fig. 1a) is one of the oldest’, 
most interesting, most useful and most well-studied of all reactive 
intermediates in chemistry. The multifaceted and often remarkably 
efficient reactions of benzynes with suitable trapping reagents (13 
in Fig. 1a) have long been employed in the service of synthetic chemistry. 
Even by 1967 many such reactions were known’. Nonetheless, newly 
discovered benzyne reaction motifs continue to emerge, especially so 
within the last decade*"®. This renaissance attests to the additional ver- 
satility and heralds even greater potential of this intermediate. Currently, 
all practical methods for producing benzynes involve the removal of two 
adjacent atoms or substituents (‘G’ and ‘X’; Fig. la) from benzenoid 
precursors (2). These protocols typically require the use of a strong base 
and/or access to a 1,2-disubstituted arene substrate. A complementary 
general method for benzyne/aryne generation would considerably 
expand the preparative utility of these remarkable intermediates. 


The venerable Diels—Alder cycloaddition reaction”’ is highly regarded 
in synthetic chemistry'*’’. The prototypical event (Fig. 1b), found in 
every introductory organic chemistry textbook, is the combination 
of 1,3-butadiene (5) as the 4n-component with ethylene (4) as the 
dienophile to give cyclohexene (6)—a product in the tetrahydrobenzene 
oxidation state. If, instead, an alkyne like ethyne (7) is the dienophile, a 
1,4-cyclohexadiene (here 1,4-dihydrobenzene (8)) results (Fig. 1c); we 
suggest this be viewed as a didehydro-Diels—Alder reaction. Another 
known variant (Fig. 1d) involves engagement of a (yet more highly 
oxidized) 1,3-enyne (for example, 9) as the 4m-component with an 
alkyne (for example, 7). It is interesting to note that the first example 
of this process (the thermal dimerization of phenylpropiolic acid)"* was 
described 30 years before the initial report of Diels and Alder’’. The 
intermediate cyclic allene 10 rapidly rearranges via a [1,5] hydrogen 
atom shift to benzene (11). This process has until now been called, 
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Figure 1 | Diels-Alder reactions of varying oxidation states. a, Generic 
benzyne generation (2 to 1) and trapping (1 to 3). Most commonly, 

G/X = H/halogen, CO, / N2*, halogen/halogen, or TMS/OTE. b-e, Prototypes 
of Diels-Alder [4+2] (that is, the combination of a four-atom (or 47 electron) 
component with a two-atom (or 27 electron) component to produce a six- 
membered ring) reactions differing in the oxidation levels of the reactant pairs 


and products: classic Diels-Alder (b), didehydro-Diels—Alder 

(c), tetradehydro-Diels—Alder (TDDA) (d), and hexadehydro-Diels—Alder 
(HDDA, this work) (e) reactions. The aromatic character of benzyne is 
emphasized by the resonance contributors 1 and 1’ as well as by the resonance 
hybrid structure 1*. TMS, trimethylsilyl; OTf trifluoromethanesulphonate; 
Nu, nucleophile; El, electrophile. 
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generically, the dehydro-Diels—Alder (DDA) reaction”. In the light of 
the generality of the results we present here, we suggest that the trans- 
formation in Fig. 1d would be better named the tetradehydro-Diels— 
Alder (TDDA) reaction. 

The most highly oxidized Diels—Alder variant (Fig. le) is the cyclo- 
addition between a 1,3-diyne like 12 and an alkyne diynophile like 7, 
which generates o-benzyne (compare 1’, 1 and 1*). This hexadehydro- 
Diels-Alder (HDDA) reaction is the subject of this Article. Given the 
efficiency, ease of precursor access, versatility and mild reaction condi- 
tions revealed by the examples we present here, it is remarkable that this 
reaction has remained essentially unexploited'*”. It is interesting to 
speculate that this may be due in part to the fact that the most common 
depiction of benzyne—the resonance contributor 1 (Fig. le)—obscures 
its potential construction via a [4+2] cycloaddition reaction. It is the 
alternative, but rarely encountered, Kekulé depiction (1) that reveals 
the opportunity for assembly via an HDDA process. 

We report below the broad scope of a strategy that combines the 
versatile and efficient generation of benzynes via the HDDA reaction 
with various trapping reactions to yield structurally complex ben- 
zenoid products. Each substrate is a readily accessible conjugated 
diyne containing a remote alkynyl diynophile. These cycloisomerize 
in (a highly exergonic) [4+2] fashion to produce the reactive aryne 
intermediate. The examples demonstrate that the tandem benzyne 
forming/trapping sequence can be designed to proceed with excellent 
efficiency. 


The HDDA reaction revealed 


In the course of an otherwise unrelated study, we attempted to prepare 
the ketotetrayne 14 by oxidation of the precursor alcohol 13 with 
manganese dioxide (Fig. 2a). To our surprise, the major product from 
this experiment, formed in about 5 hours, was the (hexasubstituted) 
benzene derivative 15 (53% yield after purification). We quickly pos- 
tulated that the benzyne intermediate 16’/16 was being both readily 
formed and efficiently trapped by the nucleophilic oxygen atom in the 
fortuitously poised B-siloxyethyl group. Migration of the silyl group 
from O to C within zwitterion 17, a retro-Brook rearrangement, 
accounts for formation of 15. This constitutes an unprecedented 
mode of benzyne trapping. Additionally, the process is attended by 
a substantial increase in structural complexity. The potential power of 
this transformation was immediately apparent. 

We surmised that the modest yield observed in this reaction of 
tetrayne 14 reflected the fact that two competitive modes of [4+2] 
cyclization are possible. We were also keen to learn the feasibility of 
cyclization of analogous triynes. We therefore designed and synthesized 
a substrate—the ketotriyne 21 (Fig. 2b)—that could only undergo a 
HDDA reaction with a single regiochemical outcome. Our efforts were 
rewarded by its smooth transformation at room temperature to the 
hexasubstituted, tetracyclic indenone derivative 22 in 93% yield after 
chromatographic purification. 


Intramolecular trapping 


As the examples presented in Table 1 clearly show, the HDDA- 
initiated cascade has considerable scope with respect to both the 
cycloisomerization and the intramolecular trapping events. Each sub- 
strate is readily accessible by a convergent coupling strategy (compare 
Fig. 2b). All yields of purified products were 275%, and (with the 
exception of entry 8) all reactions occurred between temperatures of 
20 and 120 °C. Highlights include: the presence of an electron with- 
drawing substituent on the diynophile enhances substrate reactivity 
(compare conditions for 21 to 22 (Fig. 2b) with entry 1); the activating 
carbonyl group can be a distal (carboalkoxy) rather than a tethering 
substituent (entries 2 and 3); many classical methods of aryne gen- 
eration are not compatible with electron-withdrawing groups in the 
substrate”; carbonyl activation is not a necessity (entries 1, 4 and 7); 
products having nitrogen-containing heterocycles annulated to the 
new arene ring can be prepared (entries 3-5); an ester tether (entry 6) 
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Figure 2 | Mechanistic rationale, substrate synthesis and mild conditions 
for our initial two HDDA reactions. a, Serendipitous observation of the 
HDDA reaction: cyclization of ketone 14 (see blue dashed lines) to putative 
benzyne (red) intermediate 16/16’ (via formation of the bonds indicated by the 
black dashed line in each) and subsequent trapping by the pendant silyl ether 
gave a hexasubstituted benzenoid (blue), the indenone 15. b, Synthesis of 

21 (via convergent coupling of 18 with 19, addition ofan ethynyl unit to 20, and 
oxidation) and its facile, high-yielding conversion to the tetracyclic benzenoid 
(blue) 22. The adjacent atoms denoted by the triangle and dot in substrate 

21 map onto those of the product 22. The half-life (t;/2) for conversion of 21 to 
22 was measured by in situ NMR analysis (Supplementary Fig. 1). TBS, 
t-butyldimethylsilyl. 


cyclizes more slowly than its N-phenyl amide analogue (entry 5), 
consistent with the lower concentration of the s-cis conformation 
required for ring closure; our observations are consistent with the 
absence of radical character in both the cycloaddition and trapping 
phases of the process (for example, reactions performed in chloroform 
solvent, an excellent hydrogen atom donor, have shown no evidence of 
hydrogen atom transfer (entries 4 and 9)); the new silyl ether trapping 
reaction has considerable generality (entries 1-3, 5, 6 and 8 and Fig. 2); 
other efficient internal benzyne traps include tethered alcohols (entries 
4 and 7), aryl rings ([4+2] cycloaddition in entry 9), or alkenes (ene 
reaction in entry 10); seven-membered ring formation is feasible (entry 
8), and the robust nature of the substrate and product at the high 
temperature required for this slower cyclization are noteworthy; and 
finally the silyl substituents in many of the products provide handles 
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Table 1 | Examples of intramolecular trapping of HDDA-generated 
benzynes 


Entry Substrate Conditions Product 
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Benzenoid (bold black bonds) synthesis via the HDDA cycloaddition has considerable substrate scope 
with respect to the nature of the poly-yne tether (red) and of the intramolecular trapping moiety (blue). 
Ac, acetyl; Ts, p-toluenesulphonyl; o-DCB, 1,2-dichlorobenzene. 
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for subsequent elaboration through protonative, oxidative or halo- 
genative desilylation or cross-coupling reactions”. 


Intermolecular trapping 


We wished to validate the feasibility of bimolecular trapping of these 
thermally generated benzynes. Clearly, this would add considerable ver- 
satility and power to a HDDA-initiated transformation. The triyne 23 
(Fig. 3) represents a substrate that bears an innocent (non-participating) 
side chain. We have successfully captured the derived benzyne 24 
(formed from 23 in a cyclization reaction having a half-life of ~4h 
at about 85 °C) by a variety of external reagents to give adducts 25. 
Highlights include: benzene as solvent forms the Diels—Alder adduct 
25a (compare entry 9 of Table 1)—although this process has 
been previously documented”, because of the low reactivity of simple 
aromatics, rarely have they been trapped by benzynes in high yield; 
this result also indicates that many intramolecular trapping events are 
faster than capture by the aromatic solvents used in earlier examples 
(entries 1-3 and 5-8 of Table 1); the [2+2] cycloaddition of nor- 
bornene gives 25b in higher yield than has been observed for trapping 
of arynes formed by conventional methods’; N-phenylacetamide gives 
25c, demonstrating that a nitrogenous substituent can be conveniently 
installed; acetic acid** or phenol traps 24 to cleanly provide adducts 
25d or 25e, respectively, in processes that may share a mechanistic 
feature of transfer of a hydroxyl proton coincident with nucleophilic 
attack; this mode of reaction with acetic acid or phenol is unique and 
complementary to that seen with benzynes generated by non-reagent- 
free methods”; and finally net trapping by hydrogen bromide was 
achieved using Br(CH2),NH,*HBr in THF/H,O (20:1) as an HBr 
source to give 25f (6:1 mixture of isomers). 

The sense of regioselectivity observed for formation of products 
25c-f is consistent with the analyses described in refs 24 and 25, in 
which the relative magnitude of the computed internal bond angles of 
an unsymmetrical benzyne is correlated with the site of nucleophilic 
attack. Namely, the more obtuse angle corresponds to the more elec- 
tron deficient (5*) of the two benzyne carbon atoms. We computed 
the geometry for 24 (R = CHs) using density functional theory (DFT) 
at the M06-2X/6-31+G(d,p)”° level to have internal angles of 135° 
and 119° at atoms ‘a’ and ‘b’, respectively. We are currently inves- 
tigating additional substrates that should allow us to distinguish the 
relative impact of electronic versus ring-distortion effects on the site 
of attack by external nucleophiles. 

To gain understanding of some of the key thermodynamic features 
associated with the HDDA aryne-forming step, we turned to compu- 
tational analysis of the reaction of ester 26 (Fig. 4). This simple triyne 
was cycloisomerized and the resulting benzyne trapped in t-butanol 
(120 °C, closed tube) to produce 5-t-butoxyphthalide (28) in 68% 
yield. Johnson and co-workers recently reported a DFT approach to 
compute the energetics of the hypothetical HDDA reaction of 7 with 
12 to produce o-benzyne (1, Fig. 1le)’’. They concluded that this 
benzyne-forming reaction was exothermic by 51.4kcal mol *. Using 
similar DFT methodology, we have computed the free energy of reac- 
tion for the conversion of triyne 26 to the aryne 27 and found it to be 
—51kcal mol '. It is remarkable that highly reactive benzyne inter- 
mediates can be accessed by a thermal process that is exothermic by 
~50kcal mol '. These very favourable reaction energetics reflect the 
large amount of potential energy inherent in the (albeit kinetically 
protected) alkyne functional group. Finally, this point is further under- 
scored by the computed free energy of reaction—namely, —73 kcal 
mol '—for the trapping by t-butanol of the strained alkyne”* in 27. 
Thus, the overall transformation of 26 to 28 is computed to be exo- 
thermic by >120 kcal mol™'. 


Discussion 

Our results show that the HDDA reaction is a general method for 
generating benzynes under conditions amenable to a wide variety of 
intra- and intermolecular trapping events. This powerful and efficient 
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Figure 3 | Examples of intermolecular trapping of HDDA-generated 
benzynes. Bimolecular trapping reactions of benzyne 24 to give adducts 25a- 
f. Under the structure of each of these adducts is given information on the 


domino sequence comprises a fundamentally new way to synthe- 
size benzenoid compounds—especially those having high structural 
complexity. Because HDDA-derived benzynes are produced in the 
absence of by-products and external reagents, the intrinsic reactivity 
of a benzyne can be more fully explored and exploited. Even at this 
early stage of development, many additional lines of investigation 
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Figure 4 | Computed free energy changes for a representative 

HDDA- initiated cascade. Free energies of reaction (AGyo6-2x = 
Gyoe-2x(product) — Gyos-2x(reactant)) were computed by DFT at the 
M06-2X/6-31+G(d,p) level for the aryne-generating (26 to 27) and t-butanol 
(‘BuOH) trapping (27 to 28) reactions. These serve to highlight the favourable 
thermodynamics associated with both the triyne cycloisomerization and aryne 
trapping events. 


Phenol (0.1 M); 85% 
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BrCH,CH,NH,*Br (0.1 M); 72% 
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Solvent = THF:H,O (20:1) 


trapping agent (and amount) and yield following purification. Stereochemical 
information is given for adduct 25b. Regiochemical information is given for 
adducts 25c-f. THF, tetrahydrofuran. 


have revealed themselves. These include studies of mechanistic issues 
(concerted versus stepwise cycloaddition), substituent effects (electronic 
versus steric), catalysis (Lewis acid or transition metal), new modes 
of benzyne trapping, the use of cleavable tethers, and the feasibility 
of bimolecular HDDA cycloadditions. This enabling technology is 
well-suited for preparing compound libraries, for accessing aromatics 
having substitution patterns that would be otherwise challenging to 
prepare, and for application to target molecule synthesis (for example, 
drugs, natural products”"°, heteroaromatics and polyacenes). Lessons 
of how imagery (that is, 1 (versus 1’) from 7 + 12) can bias our 
perception are also embedded in the developments described here. 
Finally, our results serve as a reminder that, even in the twenty-first 
century, serendipitous discovery (“in the course of an otherwise unre- 
lated study”) in science can still play a pivotal role. 


METHODS SUMMARY 


General procedure. An oven-dried vial containing the triyne precursor in the 
indicated solvent (0.05 M) was closed with a Teflon-lined cap and heated at the 
indicated (external bath) temperature. After the indicated time the product was 
purified by chromatography on silica gel. Full procedural details and characteri- 
zation data for all new compounds (substrates and products) and a detailed 
description of the computational protocols and results are provided in Supplemen- 
tary Information. 
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Small heat-shock proteins protect from 
heat-stroke-associated neurodegeneration 


Nikos Kourtis', Vassiliki Nikoletopoulou' & Nektarios Tavernarakis! 


Heat stroke is a life-threatening condition, characterized by catastrophic collapse of thermoregulation and extreme 
hyperthermia. In recent years, intensification of heat waves has caused a surge of heat-stroke fatalities. The mechanisms 
underlying heat-related pathology are poorly understood. Here we show that heat stroke triggers pervasive necrotic cell 
death and neurodegeneration in Caenorhabditis elegans. Preconditioning of animals at a mildly elevated temperature 
strongly protects from heat-induced necrosis. The heat-shock transcription factor HSF-1 and the small heat-shock 
protein HSP-16.1 mediate cytoprotection by preconditioning. HSP-16.1 localizes to the Golgi, where it functions with the 
Ca’*- and Mn**-transporting ATPase PMR-1 to maintain Ca** homeostasis under heat stroke. Preconditioning also 
suppresses cell death inflicted by diverse insults, and protects mammalian neurons from heat cytotoxicity. These 
findings reveal an evolutionarily conserved mechanism that defends against diverse necrotic stimuli, and may be 
relevant to heat stroke and other pathological conditions involving necrosis in humans. 


Heat-related pathologies such as heat stroke are estimated to soon 
become one of most serious causes of mortality’. Climatic shifts and 
other contemporary anthropogenic causes contribute to raise preval- 
ence of hyperthermia incidents and associated deaths’. During heat 
stroke, core body temperature in excess of 40 °C elicits acute tissue 
injury and multi-organ failure that is often fatal’. The nervous system 
is particularly vulnerable. Heat-stroke survivors commonly suffer 
permanent neurological damage*. The devastating impact of hyper- 
thermia on human health is a consequence of immediate effects of 
heat on cellular and organismal physiology, coupled with secondary 
inflammatory and coagulation responses’. Despite the severity and 
increasingly prevalent health risks associated with heat-inflicted 
damage, the cellular and molecular mechanisms responsible for the 
direct cytotoxicity of heat are not understood well. 


Heat stroke kills by inducing necrosis 


To gain insight into the molecular basis of heat cytotoxicity and to 
circumvent the confounding influence of secondary physiological and 
inflammatory responses, we developed and characterized a genetic- 
ally tractable model of heat stroke in C. elegans, a poikilothermic, 
nematode worm that normally grows at ambient temperature. C. 
elegans is particularly suitable for investigating the direct cytotoxicity 
of heat because it lacks elaborate thermal regulation systems that 
would offset elevated external temperature by mounting homeother- 
mic responses. Worms instantly equilibrate to ambient temperature. 
To simulate hyperthermia, we exposed synchronous nematode popu- 
lations to 39°C for 15 min. We then assessed animal survival 16 to 
18h after delivery of the noxious heat pulse (Supplementary Fig. 1a). 
Excessive heat caused immediate changes in animal behaviour and 
marked increase of mortality. Ultimately, less than 25% of animals 
survived heat stroke (Fig. 1a). 

We observed widespread cell death in several tissues of afflicted 
individuals (Fig. 1b, c and Supplementary Fig. 2). Dying cells showed 
necrotic morphological features (Supplementary Fig. 1c and Supplemen- 
tary Fig. 3a-e), expressed markers of necrotic death® (Supplementary 
Fig. 1d), and became permeable to propidium iodide (Supplementary 
Fig. le). Loss of key apoptosis or autophagy mediators did not 


suppress heat-stroke-induced mortality (Supplementary Fig. 4a, b). 
Moreover, autophagy was not upregulated in animals exposed to heat 
stroke (Supplementary Fig. 4c, d). Therefore, cell death consequent to 
heat stroke does not depend on the core apoptotic or autophagic 
machinery. In contrast, depletion of proteins required for necrosis®” 
strongly promotes survival after heat stroke (Supplementary Fig. 4e). 
Thus, we conclude that heat stroke compromises viability by triggering 
extensive necrotic cell death in the nematode. 


Preconditioning defends against hyperthermia 


All organisms possess homeostatic mechanisms that promote survival 
by mounting adaptive responses to stressors®. Acute insults such as 
extreme heat may cause necrosis by overwhelming cellular homeostasis. 
Interestingly, pre-exposure to mild stress often elicits increased resist- 
ance to subsequent severe stress, a phenomenon termed hormesis’. We 
investigated whether experiencing intermediate, non-lethal temper- 
ature before heat stroke (preconditioning) evokes a similar hormetic 
effect in C. elegans. We found that preconditioning at 34 °C for 30 min 
(Supplementary Fig. 1a) markedly enhanced the capacity of animals to 
withstand heat stroke (Fig. la and Supplementary Figs 1c and 2). 
Survivors maintained viability over several days after heat stroke and 
showed no behavioural defects compared to untreated controls 
(Supplementary Fig. 5 and Supplementary Table 1). 

Preconditioning may exert its protective effect by engaging the 
heat-shock-response pathway to fortify cells under extreme temper- 
ature. To test this idea, we examined mutants lacking HSF-1, the 
transcription factor that coordinates the heat-shock response’’. 
Preconditioning does not increase survival of these animals, indi- 
cating that HSF-1 is required for protection against heat stroke. 
Moreover, overexpression of HSF-1 suppresses necrotic cell death 
and augments survival further, both after and without precondition- 
ing (Fig. 1a and Supplementary Fig. 2). Therefore, HSF-1 is necessary 
and sufficient to render animals resistant to heat stroke. 

Low insulin-like signalling (ILS) increases intrinsic thermotoler- 
ance of C. elegans during prolonged incubation at intermediate tem- 
perature (35 °C)". We find that ILS deficiency enhances survival after 
heat stroke, both with and without preconditioning (Supplementary 
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Figure 1 | Heat preconditioning protects against extreme thermal stress 
through HSF-1 and HSP-16.1. a, Survival of wild-type animals, hsf-1(sy441) 
or hsp-16.1(0k577) mutants, animals overexpressing hsf-1 (Ex[py,s.; HSF-1]) or 
hsp-16.1 (Ex[Pict-3sgHSP-16.1]), and hsf-1 and hsp-16.1 mutants overexpressing 
hsp-16.1 after heat stroke, either without (untreated) or after preconditioning 
(n = 350 animals per assay; P< 0.001 for wild-type untreated versus 
preconditioned, P > 0.05 for hsf-1(sy441) or hsp-16.1(0k577) untreated versus 
preconditioned, P< 0.001 for wild-type untreated versus Ex[pys¢-7HSF-1] or 
Ex[Piet-gssHSP-16.1] untreated, P< 0.001 for hsp-16.1(0k577) untreated versus 
hsp-16.1(0k577);Ex(Pjct-gssHSP-16.1] untreated, P< 0.001 for hsf-1(sy441) 
untreated versus hsf-1(sy441);Ex[Pjet-gsgsHSP-16.1] untreated; two-way analysis 
of variance (ANOVA)). b, Survival of anterior CEP (cephalic) dopaminergic 
neurons of wild-type animals, and animals overexpressing hsp-16.1 under heat 
stroke conditions (m = 350 animals per assay; P< 0.001 for wild-type heat- 
stroked versus Ex[pict-gssHSP-16.1] heat-stroked animals; unpaired t-test; ***P 
< 0.001). Error bars, mean + s.e.m. c, Images of the head region of wild-type 
animals (top panel) and animals overexpressing hsp-16.1 (bottom panel) after 
heat stroke. Remnants of neuron cell bodies (arrows) and axonal beading 
(arrowheads) are seen in wild-type animals (top panel). Both soma (arrows) 
and axonal (arrowheads) architecture is preserved in animals overexpressing 
hsp-16.1. Scale bar, 20 um. 


Fig. 6a). HSF-1, which mediates intrinsic thermotolerance’’, is also 
required for the resistance of ILS mutants to heat stroke 
(Supplementary Fig. 6a). In addition to HSF-1, thermotolerance con- 
ferred by low ILS is mediated by the FOXO (forkhead box protein O) 
transcription factor DAF-16, which translocates to the nucleus under 
stress'*. We found that DAF-16 accumulates in the nucleus after 
preconditioning (Supplementary Fig. 6b, c). Such relocalization of 
DAF-16 is consistent with a synergistic effect of HSF-1 and DAF-16 
on heat-shock protein (HSP) gene promoters that facilitates their 
maximal expression”. Indeed, DAF-16 is partly required for protec- 
tion against heat stroke by preconditioning or low ILS (Supplemen- 
tary Fig. 6a). 

We also tested the requirement for other key stress response reg- 
ulators; SKN-1, which mediates oxidative stress resistance, and the 
hypoxia inducible factor HIF-1. After activation, SKN-1 translocates 
to the nucleus’*. We did not detect nuclear accumulation of SKN-1 
after preconditioning (Supplementary Fig. 6d). Moreover, SKN-1 is 
dispensable for cytoprotection by preconditioning or low ILS 
(Supplementary Fig. 6a, e). HIF-1 is also not needed for the protective 
effect of preconditioning (Supplementary Fig. 6f). Based on these 
findings, we conclude that HSF-1 specifically promotes survival under 
extreme thermal stress, following preconditioning at intermediate 
temperature. 
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After activation, HSF-1 induces expression of genes encoding 
HSPs, which protect cells from various cytotoxic conditions’*”’. 
Based on their molecular mass, HSPs are classified into six main 
families'*. A plausible mechanism pertinent to the strong shielding 
effect of preconditioning, involves the coordinated upregulation of 
batteries of heat-shock genes by HSF-1. To test this notion, we sur- 
veyed key representatives of all main C. elegans heat-shock-protein 
families for their role in heat-stress resistance acquired after pre- 
conditioning. Instead of a distributed contribution from several 
HSPs, we found that HSP-16.1, HSP-16.41 and DNJ-19 are primarily 
needed for acquired tolerance to heat stroke (Fig. la and Supplemen- 
tary Fig. 7a). However, among the corresponding genes, only hsp-16.1 
and hsp-16.41 show significant upregulation after preconditioning 
(P < 0.001 untreated versus preconditioned, unpaired t-test), whereas 
dnj-19 is constitutively expressed, even under control conditions 
(Supplementary Fig. 7b). DNJ-19 is a co-chaperone that modulates 
the activity of HSP-70. We found that hsp-70 expression is similarly 
elevated, and not subject to regulation by preconditioning (Supplemen- 
tary Fig. 7b). Of the two hsp genes that are specifically induced by 
preconditioning, only overexpression of hsp-16.1 conferred significant 
protection, bypassing the requirement for preconditioning to suppress 
necrosis and increase animal survival after heat stroke. Moreover, it 
further potentiated thermotolerance after preconditioning (Fig. 1 and 
Supplementary Figs 2 and 7c). 

HSP-16.1 belongs to the family of «-crystallin domain-containing 
small heat shock proteins (sHSPs) that are strongly induced under heat 
stress or low insulin signalling, and contribute to stress resistance and 
longevity in C. elegans'*°?°. Transcriptional upregulation of hsp-16.1 
after preconditioning is fully dependent on HSF-1 (Supplementary 
Fig. 8a). In contrast, induction of hsp-16.1 expression by low ILS is 
not abolished by loss of DAF-16 (Supplementary Fig. 8b). 

Overexpression of hsp-16.1 under the control of a heterologous, 
non-heat-shock-responsive promoter, bypasses the requirement for 
HSF-1 to defend against heat stroke (Fig. 1a). Our findings show that 
HSP-16.1 is specifically induced during preconditioning, and is both 
necessary and sufficient for resistance to hyperthermia. Other HSPs 
that seem also to be needed for extreme heat tolerance are either not 
specifically induced (for example, HSP-70 and DNJ-19) or are not 
sufficient for protection (HSP-16.41). 


PMR-1 is required for hormetic protection 


Typically, sHSPs are ubiquitously expressed and show widespread 
subcellular distribution’. Intriguingly, HSP-16.1 shows distinct local- 
ization to the medial Golgi of several cell types and is absent from 
other subcellular sites, including the closely related trans-Golgi and 
endosomes (Fig. 2a and Supplementary Figs 9a-d and 10a). HSP- 
16.41 displays a similar subcellular localization pattern (Supplemen- 
tary Fig. 10b). This highly compartmentalized distribution is main- 
tained under stress conditions in different genetic backgrounds 
(Supplementary Fig. 10c, d), suggesting that the two sHSPs function 
specifically in the medial Golgi to battle heat stress. 

The Golgi apparatus is the main organelle for protein sorting and 
post-translational modification of proteins and lipids”. In addition, 
together with the endoplasmic reticulum and mitochondria, the Golgi 
is a major cellular Ca” reservoir, facilitating cellular ion homeosta- 
sis*5, Perturbation of intracellular calcium concentration ([Ca** ];) 
homeostasis has been implicated in necrotic cell death, both in 
mammals and C. elegans**’’. We considered whether heat stroke 
might inflict necrosis by diminishing the capacity of cells to maintain 
low [Ca?*]; levels under noxious thermal stress. To test this 
hypothesis, we knocked down the itr-1 gene encoding the inositol- 
1,4,5-trisphosphate receptor (InsP3R) channel, which mediates Ca?* 
trafficking out of both the Golgi and the endoplasmic reticulum”. 
InsP3R deficiency increased both intrinsic and acquired resistance 
to heat stroke (Fig. 2b). We found a similar effect after treatment of 
animals with dantrolene, a compound that inhibits Ca?* release from 
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Figure 2 | HSP-16. 1 localizes to the medial Golgi and functions together 
with PMR-1 to mediate the protective effect of preconditioning against heat 
stroke. a, Images (cell bodies) of C. elegans neurons expressing fluorescently 
tagged HSP-16.1 along with medial (top row, middle panel) or trans-Golgi 
(bottom row, middle panel) markers. Scale bar, 6 Lum. b, Survival of animals 
deficient for ITR-1 or SCA-1 (itr-1(RNAi), sca-1(RNAi)), or animals exposed to 
dantrolene after heat stroke, either without (untreated) or after preconditioning 
(n = 300 animals per group; P < 0.001 for wild-type untreated (control) versus 
itr-1(RNAi), sca-1(RNAi) or dantrolene-exposed; two-way ANOVA). 

c, Survival of wild-type animals, nucb-1(pk1654) mutants or animals 
overexpressing nucb-1 (Ex[Pjct.sssNUCB-1)) after heat stroke, either without 
(untreated) or after preconditioning (m = 300 animals per assay; P< 0.001 for 
wild-type untreated versus Ex[pj..g5sNUCB-1] untreated, P < 0.01 for wild- 
type untreated versus nucb-1(pk1654) untreated; two-way ANOVA). 

d, Survival of animals depleted for PMR-1 (pmr-1(tm1840) and pmr-1(tm1750) 
mutants, or pmr-1(RNAi) animals) under heat stroke, either without 
(untreated) or after preconditioning (1 = 800 animals per assay; P < 0.001 for 
wild-type preconditioned versus pmr-1(tm1840), pmr-1(tm1750) or 
pmr-1(RNAi) preconditioned; two-way ANOVA). e, Survival of wild-type 
animals and hsf-1(sy441) and hsp-16.1(0k577) mutants overexpressing 

pmr-1 (Ex[Piet-3sg3? MR-1]) under heat stroke, either without (untreated) or 
after heat preconditioning (n = 300 animals per assay; P< 0.001 for wild-type 
untreated versus Ex[Pjer-5gP MR-1], hsf-1(sy441);Ex[Piet-gsgP MR-1] or 
hsp-16.1(0k577);Ex|Pict-g5g? MR-1] untreated; two-way ANOVA). Error bars, 
mean + s.e.m. 


the Golgi and the endoplasmic reticulum, through the ryanodine 
receptor (RyR), or EGTA, a Ca”” chelator (Fig. 2b and Supplemen- 
tary Fig. 11). Hence, moderating [Ca**]; levels by blocking release of 
Golgi and endoplasmic reticulum Ca** stores, confers protection 
against thermal damage. 

Ostensibly at odds with this idea, we observed a similar protective 
effect by knock down of sca-1, a gene encoding the sarco-endoplasmic 
reticulum Ca?* ATPase (SERCA), which pumps Ca** from the 
cytoplasm into both the Golgi and the endoplasmic reticulum 
(Fig. 2b). Inhibition of SERCA induces endoplasmic reticulum stress 
and the unfolded protein response (UPR) pathway, leading to adapta- 
tion and survival®*°. We speculated that activation of the UPR™ 
(UPR in the endoplasmic reticulum) may account for the paradoxical 
cytoprotection. To test this hypothesis, we monitored the UPR™ after 
downregulation of SERCA, by means of an UPR®® indicator*!. SCA-1 
depletion resulted in strong induction of UPR®® (Supplementary 
Fig. 12a). We then tested whether elevated UPR*® is sufficient to 
confer protection against heat stroke. Indeed, xbp-1 mutants, which 
display marked constitutive increase of UPR’ under physiological 
conditions”, are resistant to heat stroke (Supplementary Fig. 12b). 
However, we found that preconditioning does not activate UPRE® 
(Supplementary Fig. 13a, b), suggesting that induction of UPR™® is 
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not responsible for the protective effect of preconditioning. We also 
examined the involvement of mitochondrial UPR (UPR™), using a 
Phsp-6gQGFP (green fluorescent protein) reporter*’. Similar to the 
UPR®®, the UPR™' is not activated by preconditioning (Supplemen- 
tary Fig. 13c, d). Our findings indicate that preconditioning, as 
implemented in our assay, is highly specific in activating solely the 
heat-shock-response pathway, without engaging other stress res- 
ponses typically induced by prolonged incubation at elevated tem- 
peratures** (Supplementary Fig. 14a-d). 

None of the above manipulations that render animals resistant to 
hyperthermia is specific to the Golgi, where HSP-16.1 localizes. To 
manipulate Ca** homeostasis specifically at the Golgi, we measured 
survival of transgenic animals overexpressing nucleobindin 1 
(NUCB-1), a Golgi-resident Ca’* -buffering protein®, after heat 
stroke. We found that sequestration and retention of Ca** in the 
Golgi by nucleobindin is sufficient to protect from heat stroke, even 
without preconditioning. In contrast, animals deficient for NUCB-1 
are sensitized to heat stroke-induced damage (Fig. 2c). 

To further isolate and assess the contribution of the Golgi in Ca 
homeostasis, we examined animals lacking PMR-1, a Golgi-specific 
Ca**- and Mn**-transporting P-type ATPase® that colocalizes 
with HSP-16.1 (Supplementary Fig. 15). PMR-1 deficiency suppresses 
preconditioning-acquired resistance to heat stroke (Fig. 2d). 
Importantly, overexpression of PMR-1 is sufficient to promote 
survival after heat stroke even without preconditioning, bypassing 
the requirement for both HSF-1 and HSP-16.1 (Fig. 2e). In addition, 
pmr-1 expression is not altered by loss of HSF-1 or HSP-16.1 
(Supplementary Fig. 16). Therefore, PMR-1 functions downstream 
of HSF-1 and HSP-16.1 to protect from hyperthermia. Together, 
our findings indicate that heat acclimatization through precondition- 
ing may improve cytoprotection and animal survival by augmenting 
cellular Ca** homeostasis through the Golgi. 


2st 


A universal cytoprotective mechanism 


We considered the capacity of the mechanism involving specific heat- 
shock-response components and the Golgi PMR-1 ATPase to defend 
against additional necrotic insults unrelated to thermal stress. 
Dominant gain-of-function mutations in specific neuronal ion 
channels such as the degenerin MEC-4 and the acetylcholine receptor 
channel subunit DEG-3 evoke degenerative, necrotic cell death in 
C. elegans that is analogous to excitotoxicity in mammals’’”’. 
Hypoxic conditions and overexpression of aggregation-prone 
proteins such as o-synuclein, also trigger necrosis in the nem- 
atode***. We exposed animals to brief heat preconditioning, before 
assaying cell death induced by toxic proteins or survival under hypoxia 
(Supplementary Fig. 1b). Remarkably, preconditioning suppressed 
neurodegeneration triggered by hyperactive ion channels or by over- 
expression of human o-synuclein (Fig. 3a, b and Supplementary 
Fig. 17a). Moreover, preconditioning increased animal resistance to 
hypoxic conditions (Fig. 3c). Preconditioning does not alter the 
expression, stability or function of cytotoxic proteins to ameliorate 
cell death (Supplementary Fig. 17b-d). In addition, suppression of 
neurodegeneration is not a mere consequence of a developmental 
delay in the onset of cell death (Supplementary Fig. 17e). Therefore, 
the heat-shock-response pathway intervenes downstream of necrotic 
insults to block necrosis. Cytoprotection is mediated by HSF-1 and 
HSP-16.1, which are both necessary and sufficient to ameliorate 
necrotic cell death (Fig. 3d, e). Thus, induction of the HSF-1-HSP- 
16 axis generally protects against necrosis inflicted by diverse stressors. 

How might the heat-shock response neutralize necrotic stimuli? 
Excitotoxic insults initiate neuronal necrosis through elevation of 
[Ca**], and subsequent aberrant activation of specific Ca”* -dependent 
calpain proteases. In turn, calpains promote cell destruction by 
facilitating lysosome rupture®*”°. We found that overexpression of 
NUCB-1 suppresses excitotoxic cell death, indicating that Ca”* release 
from the Golgi contributes to neurodegeneration (Fig. 3f). 
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Figure 3 | HSF-1 and HSP-16.1 mediate protection against necrosis 
inflicted by diverse insults upon preconditioning. a, Number of neuron 
corpses at the L1 larval stage of development, per 100 animals carrying the 
neurotoxic dominant (d) mec-4(d) or deg-3(d) alleles, hatched from untreated 
or preconditioned eggs (n = 350 animals per assay; P< 0.001 for untreated 
versus preconditioned; two-way ANOVA). b, Dopaminergic neuron survival 
on the fifth day of adulthood in wild-type animals overexpressing «-synuclein, 
without or after receiving daily preconditioning (30 or 60 min per day; n = 350 
animals per assay; P< 0.001 for untreated versus preconditioned; unpaired 
t-test; ***P < 0.001). c, Survival of wild-type animals after near-lethal 
treatment with sodium azide (NaN;), either without or after preconditioning 
(n = 350 animals per assay; P< 0.001 for untreated versus preconditioned; 
unpaired t-test). d, Number of neuron corpses, at the L1 larval stage, per 100 
hsf-1(sy441);mec-4(d) double mutants, and mec-4(d) animals overexpressing 
hsf-1 hatched from untreated or preconditioned eggs (n = 350 animals per 
assay; P > 0.05 for hsf-1(sy441);mec-4(d) untreated versus preconditioned, 
P<0.001 for mec-4(d) untreated versus mec-4(d);Ex|Ppsp1HSF-1]; two-way 
ANOVA). e, Number of neuron corpses at the L1 larval stage, per 100 hsp- 
16.1(0k577);mec-4(d) double mutants, and mec-4(d) animals overexpressing 
hsp-16.1 hatched from untreated or preconditioned eggs (n = 350 animals per 
assay; P > 0.05 for hsp-16.1(0k577);mec-4(d) untreated versus preconditioned, 
P<0.001 for mec-4(d) untreated versus mec-4(d);Ex[Pmec-17HSP-16.1]; two- 
way ANOVA). f, Number of neuron corpses, at the L1 stage, per 100 mec-4(d) 
mutants overexpressing nucb-1 (n = 250 animals; P < 0.001, unpaired t-test). 
Error bars, mean + s.e.m. 


To examine whether heat stroke also triggers neurodegeneration by 
increasing [Ca* ];, we monitored cytoplasmic Ca’ * levels in C. elegans 
neurons after heat stroke. Similar to the cytotoxic dominant MEC-4 
ion channel, extreme thermal stress causes a sharp increase in [Ca?*]; 
levels (Fig. 4a, b). Heat preconditioning or inhibition of the InsP;R 
channel, the SERCA pump and the RyR channel quenches [Ca”*]; 
elevation after subsequent heat stroke (Fig. 4a). Preconditioning by 
itself does not increase Ca** levels (Supplementary Fig. 18). These 
findings parallel the protective effect of the above interventions against 
heat stroke-induced damage and death (Fig. 2b). In contrast, loss of the 
Golgi-specific PMR-1 Ca’* importer results in constitutive increase of 
[Ca**]; levels (Fig. 4a), consistent with the requirement for PMR-1 in 
preconditioning-acquired thermotolerance (Fig. 2d). Moreover, loss of 
PMR-1 exacerbates degenerin-induced necrotic cell death (Supplemen- 
tary Fig. 17f). Thus, Ca** import into the Golgi through PMR-1 is 
critical for survival after both heat stroke and excitotoxic insults. 
Combined, our data reveal a previously unsuspected and prominent 
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Figure 4 | Preconditioning requires PMR-1 to alleviate heat-stroke-induced 
cytoplasmic Ca** overload. a, Fluorescence intensity of neurons expressing 
the Ca?t reporter GCaMP2.0, in wild-type animals, pmr-1(tm1840) mutants, 
and animals deficient for ITR-1, SCA-1 or treated with dantrolene, under 
normal conditions (untreated) and after heat stroke, either without or after 
preconditioning (n = 60 animals per assay; P< 0.001 for wild-type heat- 
stroked versus wild-type untreated or wild-type preconditioned; P > 0.05 for 
similar comparisons in mutant strains or after treatment with dantrolene; 
P<0.001 for comparisons between pmr-1(tm1840) and other mutants or 
dantrolene treatment; two-way ANOVA). b, Fluorescence intensity of neurons, 
expressing GCaMP2.0, in wild-type animals, mec-4(d) mutants and mec-4(d) 
animals overexpressing hsp-16.1 (n = 150 animals per assay; P < 0.001 for mec- 
4(d) versus wild-type or mec-4(d);Ex[Pmec-17HSP-16.1]; unpaired t-test 

**P < 0.01). Representative images of neuron cell bodies are shown above each 
strain, for each condition. Scale bars, 10 um. Error bars, mean = s.e.m. 


role of the medial Golgi in maintaining Ca** homeostasis under 
cytotoxic stress. 


A conserved pathway against hyperthermia 


Our findings using a C. elegans hyperthermia model implicate 
specific heat-shock-response components and Golgi-mediated Ca** 
homeostasis in resistance to a broad spectrum of necrotic insults, 
acquired after heat preconditioning. Has this protective mechanism 
been maintained during metazoan evolution or is it peculiar to the 
nematode? To address this question, we investigated whether key 
aspects of heat stroke neuropathology and heat preconditioning are 
conserved in mammalian neurons. To this end, we differentiated 
mouse embryonic stem cells into a homogeneous population of 
glutamatergic excitatory neurons*’. Heat stroke induced pervasive 
death of embryonic-stem-cell-derived neurons (Fig. 5a, b and Sup- 
plementary Fig. 19a), and this effect could be largely prevented by heat 
preconditioning (Fig. 5a, b). We obtained similar results using 
primary cultures of mouse cortical (Fig. 5a, b) and striatal neurons 
(data not shown). In all cases, heat-stroked neurons showed axonal 
degeneration and other characteristics typical of necrotic death 
(Supplementary Fig. 19a, b). 

Notably similar to the response in C. elegans, we found that over- 
expression of crystallin «A (CRYAA)”, which co-localizes with the 
Golgi marker o-mannosidase II and the PMR1 ATPase (Supplemen- 
tary Fig. 20a, b), was sufficient to protect mammalian neurons from 
heat-stroke-induced death, even in the absence of preconditioning 
(Fig. 5c). Finally, we tested whether the protective effect of precondi- 
tioning in mammalian neurons is also mediated by the PMR1 
ATPase**. Heat stroke caused massive necrotic death and axonal 
degeneration in neurons expressing short hairpin RNAs (shRNAs) 
against Pmrl, even after preconditioning (Fig. 5d; Supplementary 
Fig. 21). Thus, knockdown of Pmr1 abolishes the protective effect of 
preconditioning in mammalian neurons. These findings suggest 
that hormetic stress exposure engages a potent, evolutionarily con- 
served mechanism to defend against necrotic neurodegeneration, in 
metazoans as diverse as nematodes and mammals. 
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Figure 5 | Heat preconditioning protects mammalian neurons against 
extreme thermal stress through crystallin aA and PMR1. a, Images of 
embryonic stem cell (ESC)-derived neurons (top panels) and primary cortical 
neurons (bottom panels) immunostained for B-III tubulin, under normal 
conditions (untreated) and after heat stroke, either without or after 
preconditioning. Neuronal nuclei are stained with Hoechst. Scale bar, 100 jum. 
b, Corpses of ESC-derived and primary cortical neurons, under normal 
conditions (untreated) and after heat stroke, either without or after 
preconditioning (n = 900 neurons per assay, P< 0.001 for ESC-derived or 
cortical heat-stroked neurons versus corresponding preconditioned neurons; 
two-way ANOVA; error bars, mean + s.e.m.). ¢, Images of ESC-derived 
neurons overexpressing Cryaa, or control vector after lentiviral infection, under 
normal conditions (untreated) and after heat stroke. Scale bar, 100 um. 

d, Images of ESC-derived neurons after Pmr1 downregulation, under normal 
conditions (untreated) and after heat stroke, either without or after 
preconditioning. Scale bar, 100 um. 


Preconditioned 


Control vector 


Lenti-Cryaa 


B-HIl tubulin 


B-IIl tubulin Hoechst 


Discussion 


In this report, we describe a versatile, genetically tractable C. elegans 
model of heat stroke and show that a single sHSP, HSP-16.1, is an 
effective general protector against multiple necrotic insults. HSP-16.1 
localizes in the Golgi, where it functions together with the PMR-1 
pump to prevent cytoplasmic Ca** overload under extreme stress. 
Hormetic induction of HSP-16.1 expression by brief heat precondi- 
tioning is sufficient to fortify cells against diverse insults. Our find- 
ings, in both C. elegans and mammalian neurons indicate that an 
evolutionarily conserved pathway underlies the protective response 
(Fig. 6c). Three specific predictions derive from this model. First, that 
PMR-1 is required for resistance to extreme temperature, conferred 
by HSP-16.1 overexpression. Second, that HSP-16.1, NUCB-1 and 
PMR-1, which are exclusively localized in the Golgi, may promote 
survival by augmenting Ca** sequestration into the Golgi and con- 
sequently maintaining subthreshold [Ca?*]; levels. Third, impor- 
tantly the model also predicts that moderation of [Ca”*]; levels by 
HSP-16.1 requires PMR-1. We tested each of these predictions and 
found that, in complete agreement with the model, PMR-1 depletion 
diminishes the protective effect of HSP-16.1 overexpression (Fig. 6a). 
In addition, HSP-16.1, NUCB-1 or PMR-1 overexpression moderates 
[Ca’*]; elevation induced by heat stroke (Fig. 6b). Moreover, the 
effect of HSP-16.1 is fully dependent on PMR-1 (Fig. 6b). 

What is the molecular basis of HSP-16.1 protective function in 
the Golgi? sHSPs assemble into oligomeric complexes of variable 
stoichiometry and serve as molecular chaperones, efficiently binding 
denatured proteins and/or preventing irreversible protein aggregation 
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Figure 6 | HSP-16.1 requires PMR-1 to suppress heat-stroke-induced 
necrosis and cytoplasmic Ca?* overload. a, Survival of wild-type animals or 
pmr-1(tm1840) mutants overexpressing hsp-16.1, under heat stroke (n = 400 
animals per assay; P< 0.001 for Ex[pjer.gssHSP-16.1] versus pmr- 
1(tm1840);Ex|[pict-gssHSP-16.1]; unpaired t-test; ***P < 0.001). b, Fluorescence 
intensity of touch receptor neurons expressing GCaMP2.0, in wild-type animals 
overexpressing nucb-1, pmr-1 and hsp-16.1, or in 

pmr-1(tm1840) mutants overexpressing hsp-16.1 under normal conditions 
(untreated), and after heat stroke (n = 250 animals per assay; P < 0.001 for wild- 
type untreated versus wild-type heat-stroked; P > 0.05 for Ex[Pjct3ssNUCB-1], 
Ex([Piet-3sg3PMR-1], Ex[Pimec-17HSP-16.1] and pmr-1(tm1840);Ex[P mec-17HSP- 
16.1] untreated versus heat-stroked; two-way ANOVA). Error bars, 

mean + s.e.m. Representative images of neuron cell bodies are depicted above 
each strain, for each condition. Scale bar, 10 jum. c, Schematic representation of 
the protective pathway by which preconditioning augments resistance to heat- 
stroke-induced necrotic cell death. Arrows denote activation or upregulation. 
Blunt-ended arrow denotes inhibition. 


and insolubilization*’. Importantly, mammalian PMR1 is selectively 
impaired during ischaemic or reperfusion brain injury***”. We pro- 
pose that HSP-16.1 contributes to stabilize and protect the stress-labile 
PMR-1 pump, allowing for efficient clearance of Ca" from the cyto- 
plasm, after necrotic insults (Supplementary Fig. 22a-d). 

Our study reveals a novel protective mechanism operating specif- 
ically in the Golgi to defend against a wide range of necrosis initiators. 
Given the strong evolutionary conservation of the proteins involved, 
this mechanism is probably relevant to related human pathologies. 
Indeed, noxious heat disrupts cytoplasmic calcium Ca** absorption 
by subcellular organelles**. In addition, malignant hyperthermia sus- 
ceptibility has been linked to mutations that cause resting Ca** leak- 
age to the cytoplasm through RyR*”°. Therefore, our findings could 
facilitate identification of candidate common intervention targets, in 
the effort to battle necrosis-related disorders in humans. 


METHODS SUMMARY 


Nematode heat stroke and preconditioning assays. For heat stroke assays, 
synchronized populations of animals were grown under optimal growth condi- 
tions at 20 °C to young adult stage (early egg-laying). Synchronized populations 
were generated by transferring 20 to 30 gravid adult animals on fresh plates, 
letting them lay eggs for 1.5h and then removing the adults. Animals were 
collected with M9 in 1.5-ml eppendorf tubes and incubated in 200 pl M9 buffer, 
at 39 °C for 15 min, in a water bath. Untreated worms were kept at 20 °C for the 
same time period. Survival rates were examined approximately 16 to 18h post 
heat treatment. Animals that either moved freely or responded to tapping of the 
plate by moving the head or tail region of the body were considered survivors. To 
test whether prior activation of the heat-shock response (preconditioning) leads 
to increased survival against heat stroke, synchronized animals (prepared as 
described above) were incubated at 34°C for 30 min, after a 20-min recovery 
incubation at 20°C and then incubation at 39°C for 15 min. Control animals 
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were kept at 20°C after the recovery incubation. Survival rates were examined 
approximately 16-18h post heat treatment at 39 °C. Survivors were scored as 
above. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Strains and genetics. We followed standard procedures for C. elegans strain 
maintenance*!. The nematode rearing temperature was kept at 20°C, unless 
noted otherwise. The following strains were used in this study: N2: wild-type 
Bristol isolate, MT1522: ced-3(n717)IV, KJ216: crt-1(jh101)V, clp-1(tm690)III, 
RB2035: asp-4(0k2693)X, TU1747: deg-3(u662)V (referred to in the text as deg- 
3(d)), mec-4(u231)X (referred to in the text as mec-4(d)), PS3551: hsf-1(sy441)1, 
RB791: hsp-16.1(0k577)V, VC475: hsp-16.2(gk249)V, VC1099: hsp-4(gk514)II, 
RB1104: hsp-3(0k1083)X, hsp-6(tm515)V, hsp-16.41(tm1093)V, hsp-70(tm2318)I, 
VC914: daf-21(0k1333)V, VC1348: dnj-19(gk649)V, NL4266: nucb-1(pk1654)X, 
CB1370: daf-2(e1370)II, DR26: daf-16(m26)I, daf-16(m26)I:daf-2(e1370)III, 
ZG31: hif-1(ia4)V, SJ17: xbp-1(zc12)III, $J4005: N2sIs[pysp¢GEP]V, SJ4058: 
N2;Is[Pjsp-60GFP]V, EUI1: skn1(zu67)IV/nT1(IV;V), EU31:  skn1(zu135)IV/ 
nT1(IV;V), EU40: skn1(zu129)IV/nT1(IV;V), pmr-1(tm1840)I, pmr-1(tm1750)1, 
CF1824: muEx265 [pAL9(Ppsp7HSF-1) pPD97.93(Pynyo-3GFP)]. The following 
strains were examined for neurodegeneration in this study: hsf-1(sy441)Imec- 
4(u231)X, —_ hsp-16.1(0k577)V;mec-4(u231)X, — mec-4(u231)X;Ex[Ppsy. -HSF-1], 
mec-4(u231)X;EX[Pmec-17 HSP-16.1], _ mec-4(u231)X;Ex[Pmec-17NUCB-1] — mec- 
4(u231)X3Is[Pmec-4GFP], pmr-1(tm1840)I;mec-4(u231)X. To assay the expression 
of aspartyl proteases after heat stroke we examined the N2;Ex[Pasp.4ASP-4::GFP] 
transgenic animals. To verify that preconditioning does not interfere with the 
expression or localization and stability of the toxic MEC-4 channel, we examined 
the following transgenic strains: N2;Is[P;nec-4GFP] and N23Is[Pnec-4MEC-4::GFP]. 
The following strains were examined in co-localization experiments: 
N23Ex[Pmec-17HSP-16.1::DsRed;p,;ap-3manosidase II::GFP], N23Ex[Pymec-17HSP- 
16.1::GFP5p,,p-3mCherry::APT-9], — N2;Ex[Pyec-17HSP-16.1::DsRed;p,ap_32xfyve 
domain::GFP] and N2;Ex[Pjnec-17PMR-1::DsRedspj¢¢-17HSP-16.1::GFP]. To assay 
the localization of DAF-16 and SKN-1 after heat preconditioning, we examined 
CF1139: dafl6(mu86)I;Is[pdayi6D AF-16a::GFP] and LD1: N23Is[p.4n7SKN-1b/ 
c:GFP] transgenic animals. To investigate the localization of HSP-16.41 in 
neurons, we examined N2;Ex[Pynec-177HSP-16.41::GFP] transgenic animals. To 
examine the localization of HSP1-6.1 in hsf-1(sy441)I and daf-2(e1370)II mutants 
and muEx265 [pAL9(ppy-jHSF-1) pPD97.93(Pynyo.3GFP)] transgenic animals, 
the following strains were used: hsf-1(sy441)I;Ex[Pmec-17HSP-16.1::GFP], 
daf-2(e1370)IH;Ex[Pmec-17HSP-16.1::;GFP] and muEx265  [pAL9(pnsp-1HSF- 
1);Pmec-17HSP-16.1::GFP]. To monitor autophagy induction after heat precondi- 
tioning and heat stroke, we used the QU1: N2;Ex[Ppjgg-1GFP::LGG-1] strain. The 
following strains were used for monitoring calcium levels in touch receptor 
neurons: N2;Ex[Pmec-17?¢CamP2.0], N2;Ex[Pmec-17GCamP2.0;Pynec-17HSP-16.1], 
N23EX[Pnec-1GCamP2.05Pjer-353 NUCB-1], N2;Ex[Pynec-17?GCamP2.05Pjer-g5gPMR-1], 
pmr-1(tm1840)Ex|Pmec-17GCamP2.0],  pmr-1(tm1840)I;Ex[pmec-1~+CamP2.0; 
Prntec-17HSP-16.1] mec-4(u231)X3EX[P prec 1AGCamP2.0], mec-4(U23 )XEX[P rec-AGCamP2.0; 
Pmec-17HSP-16.1]. To investigate the protective role of global expression of 
HSP- 16.1 against heat stroke, we examined N2;Ex[Pjer-sssHSP-16.1], hsp 16.1(0k577); 
Ex(Pjet-gsgHSP-16.1], hsf-1(sy441);Ex[Pier-assHSP- 16.1], Is[Pdar-1GFP];Ex[P jer-s5gHSP- 
16.1] and pmr-1(tm1840)I;Ex|Pjer-gsgHSP-16.1] transgenic animals. To investigate the 
role of global expression of nucb-1 against heat stroke, we examined 
N2;Ex[Piet-8ssNUCB-1] transgenic animals. To investigate the protective role of global 
expression of pmr-1 against heat stroke, we examined N2;Ex[pje.gsgPMR-1], hsf- 
1(sy441);Ex[Piet-ssgP MR-1] and hsp-16.1(0k577);Ex{ pict-ssgP MR-1]. To assay the pro- 
tective effect of global expression of hsp-16.41 and dnj-19 against heat stroke, we 
examined N2;Ex[Pjer-gsgHSP-16.41] and N2;Ex[Pjer-gsgDNJ-19] transgenic animals. 
To survey the protective role of preconditioning in a C. elegans model of Parkinson’s 
disease, we tested the UA44 Is[baln1; pgar1-syn, Pdar-1GFP] transgenic animals 
expressing o-synuclein in nematode dopaminergic neurons”. 

Molecular cloning. For p,,.¢¢_;7HSP-16.1, an Xmal-Kpnl fragment containing the 
hsp-16.1 coding region plus 536 base pairs (bp) of the 3’ untranslated region 
(UTR) was amplified from C. elegans genomic DNA using the primers 
5'-CCCGGGATGTCACTTTACCACTATTTCCG-3’ and 5'-GGTACCTATCC 
ATGTTCCAATTCCTGC-3’ and was cloned into the corresponding sites of 
Pmec-17GFP. Construction of Pyec-17GFP has been described previously. The 
Pimec-17HSP-16.1 construct was injected into the gonads of mec-4(u231) animals 
together with a plasmid that carries a p,,,,.-2GFP reporter fusion, expressing GFP 
in the pharyngeal muscle cells, as a transformation marker. To generate the 
Pimec-17HSP-16.1::GFP reporter construct, we fused a Smal-Agel fragment con- 
taining the coding sequence of hsp-16.1, amplified from C. elegans genomic DNA 
using the primers 5’-CCCGGGATGTCACTTTACCACTATTTCCG-3’ and 
5’-ACCGGTCCTTCAGAAGTTTTTTGTTCAACG-3’, at the amino terminus 
of GFP of the pjnec-77GFP. The translational pjyec_77HSP-16.1::GFP fusion con- 
struct was co-injected with pRF4 (contains rol-6(su1006)) into the gonads of 
wild-type animals. To generate the pje-gssHSP-16.1 construct, a KpnI-Xmal 
fragment containing the hsp-16.1 coding region plus approximately 500 bp of 
3’ UTR was amplified from C. elegans genomic DNA using the primers 


ARTICLE 


5'-GGTACCATGTCACTTTACCACTATTTCCG-3’ and 5'-CCCGGGTATC 
CATGTTCCAATTCCTGC-3’and was inserted downstream of the let-858 pro- 
moter of the L2865 plasmid vector. The pj..gssHSP-16.1 construct was injected 
into the gonads of wild-type animals together with a plasmid carrying the reporter 
YeNe Punc-122GFP, which stains the coelomocytes of C. elegans. To generate the 
Pmec-17@CaMP2.0 construct the primers 5'-CTGCAGAGCAAAGACGGCAAG 
AACTG-3' and 5'-CCCGGGGATCGAATCGTCTCACAAC-3’ were used to 
amplify a Pstl-Xmal fragment containing the promoter of mec-17 (1643 bp) from 
C. elegans genomic DNA. In addition, the primers 5’-CCCGGGATGCGG 
GGTTCTCATCATC-3’ and 5'-GATATCTCACTTCGCTGTCATCATTTG-3’ 
were used to amplify an Xmal-EcoRV fragment containing the GCaMP2.0 coding 
region amplified from the pN1-GCaMP2.0 plasmid. The two amplified fragments 
were cloned into the pPD49.26 vector. The resulting plasmid construct was 
injected into the gonads of wild-type animals together with the reporter 
Punc-122GFP. To generate the pye-gssHSP-16.1:GFP construct, the primers 
5'-ACCGGTATGTCACTTTACCACTATTTCCG-3’ and 5’-ACCGGTCCTTC 
AGAAGTTTTTTGTTCAACG-3’ were used to amplify an Agel-flanked frag- 
ment containing the coding sequence of hsp-16.1, which was then inserted down- 
stream of the let-858 promoter in L3786 plasmid. The reporter construct was 
injected into the gonads of wild-type animals together with pRF4. To generate the 
Pmec-17NUCB-1 construct the primers 5’-GGATCCATGATTAAGCCTCT 
AGTC-3’ and 5’-CCATGGTAGATTGGTGAGGTAGTG-3’ were used to amp- 
lify a BamHI-Ncol fragment containing the coding sequence of nucb-1, which was 
inserted downstream of the promoter of mec-17 in the Pjnec-77/pPD95.77 con- 
struct. The resulting construct was injected into the gonads of mec-4(d) animals 
together with the reporter Pync-122GFP. To generate the pje-sssNUCB-1 con- 
struct, the primers 5’-TCTAGAATGATTAAGCCTCTAGTC-3’ and 5’-GGG 
CCCTAGATTGGTGAGGTAGTG-3’ were used to amplify an Xbal-Apal frag- 
ment containing the coding sequence of nucb-1, which was inserted downstream 
of the let-858 promoter of the L2865 plasmid vector. The pier-sssNUCB-1 con- 
struct was co-injected with pRF4 into the gonads of wild-type animals. To 
generate the pjrsssPMR-1 construct, the primers 5’-GCGGGGTACCATGA 
ATGGATCAGTGGCTCC-3’ and 5'-TTATGGGTATCCCGTATGG-3’ were 
used to amplify a KpnI fragment containing the pmr-1 complementary DNA, 
which was inserted downstream of the Jet-858 promoter of the L2865 plasmid 
vector. The pier-gssP MR-1 construct was co-injected with pRF4 into the gonads 
of wild-type animals. To generate the pynec_y7P MR-1::DsRed reporter construct, 
the primers 5’-GCTCCCCCCGGGATGATTGAAACACTGACATC-3’ and 
5'-CCCTAGACCGGTCCAATTCCGCTGACAGACAGGG-3' were used to 
amplify an Xmal-Agel fragment containing the pmr-1 cDNA, which was inserted 
at the N terminus of DsRed of the pnec_17DsRed vector. The pmec-17P MR-1::DsRed 
construct was co-injected with pRF4 into the gonads of wild-type animals. To 
generate the pjeysssHSP-16.41 construct, a Xmal-Apal fragment containing the 
hsp-16.41 coding region plus approximately 500 bp of the 3’ UTR was amplified 
from C. elegans genomic DNA using the primers 5’-CCCGGGATGCTCA 
TGCTCCGTTCTC-3’ and 5’-GGGCCCAGAAGTTTGGAGTCAGGAGG-3’ 
and was inserted downstream of the let-858 promoter of the L2865 plasmid 
vector. The pjer-sssHSP-16.41 construct was injected into the gonads of wild-type 
animals together with pRF4. To generate the pj.;.gsgDNJ-19 construct, a Xmal- 
Apal fragment containing the dnj-19 coding region plus approximately 500 bp 
of the 3’ UTR was amplified from C. elegans genomic DNA using the primers 
5'-CCCGGGATGTTTGGAGGTGGAAGTAG-3’ and 5'-GGGCCCCATGTGT 
GCAGTATAATGTG-3' and was inserted downstream of the let-858 promoter of 
the L2865 plasmid vector. The pje;ssgsDNJ-19 construct was injected into the 
gonads of wild-type animals together with pRF4. To generate the 
Pmec-17HSP-16.41::GFP reporter construct, we fused a Xmal-Agel fragment con- 
taining the coding sequence of hsp-16.41, amplified from C. elegans genomic 
DNA using the primers 5’-CCCGGGATGCTCATGCTCCGTTCTC-3’ and 
5'ACCGGTCCATGTTTTGCAACAAAATTG-3’, at the N terminus of GFP of 
the Pymec-17GFP plasmid. The translational pjoc77HSP-16.41::GFP fusion con- 
struct was co-injected with pRF4 into the gonads of wild-type animals. For 
engineering the sca-1, itr-1 and pmr-1 RNAi constructs, gene-specific fragments 
of interest were obtained by polymerase chain reaction (PCR) amplification 
directly from C. elegans genomic DNA using the sets of primers 5'-GCGG 
CCGCTAAGGAACTCGTGCCAGGAG-3’ and 5'-GTCGACACTTGGCGA 
CAGCAGTTCC-3’, 5'-GAATTCAGCCCAATGTCCGCAATCC-3’ and 5'-GA 
ATTCACACTCAGCGACCCGATACC-3’, 5’-AACTGCAGATTGAAACACT 
GACATC-3’ and 5’-CCGCTCGAGTACCTGAAACATTCCG-3’, respectively. 
The PCR-generated fragments were subcloned into the pL4440 plasmid vector 
and resulting constructs were transformed into HT115(DE3) Escherichia coli 
bacteria deficient for RNase E. Bacteria carrying an empty vector were used in 
control experiments. For sca-1 and itr-1 RNAi the effect can be severe, leading to 


©2012 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


sterility of the animals. Lower incubation times during HT115 culture prepara- 
tion might be required. 

C. elegans cell-death assays. To assay the protective role of preconditioning 
against necrosis triggered by hyperactive channels, C. elegans animals were grown 
under optimal growth conditions at 20°C, to gravid adult stage. Animals were 
collected with M9 in 1.5-ml eppendorf tubes. After a brief centrifugation at 
10,000g, M9 was removed and 0.5 ml of bleaching solution (7 ml HO, 1 ml 
NaOH 5 N and 2 ml bleach) was added. Animals were allowed to dissolve and 
the remaining eggs were collected by centrifugation for 25s at 10,000g and 
washed twice with 1ml M9. After washing, eggs were resuspended in 200 pl 
M9 and incubated for 25 min at 34 °C, in a water bath. Alternatively, eggs were 
subjected 3 times to 8-min preconditioning periods with 45 min periods of 
recovery twice between heat-shock sessions. Control animals were maintained 
at 20°C for the duration of preconditioning. Control and preconditioned eggs 
were placed on nematode growth medium (NGM) plates and incubated at 20 °C 
until hatching. Animals were mounted in 2% agarose pads, anaesthetized with 
10 mM sodium azide and observed using differential interference contrast (DIC) 
microscopy. To test whether preconditioning perturbs the function of MEC-4, 
wild-type animals were preconditioned as described previously, whereas the 
control population was maintained at 20 °C for the course of preconditioning. 
Preconditioned and control animals were assessed for responsiveness to gentle 
body touch, at the L4 stage. Animals received five sequential gentle touches with 
an eyelash in the anterior and posterior part of the body, and their responsiveness 
(indicated by the change in the direction of movement) was recorded. For time- 
course analysis of necrotic cell death, synchronized animal populations were 
obtained by collecting embryos from gravid adults after treatment with bleaching 
solution. These populations were divided in two. One subpopulation was pre- 
conditioned and the other was maintained at 20 °C for the duration of precondi- 
tioning. In each experiment, neuron corpses were counted in each subpopulation 
at a specific developmental stage. Developmental stages from L1 to L4 were 
examined. The total number of corpses was calculated as the sum of three counts 
for each subpopulation and developmental stage. To assess the effects of precon- 
ditioning in a C. elegans model of Parkinson’s disease, NGM plates containing L4 
staged animals were preconditioned daily for 30 or 60 min by immersing the plate 
in a water bath set at 34 °C. Dopaminergic neuron loss was monitored on the fifth 
day of adulthood by scoring for loss of fluorescence from neurons expressing GFP 
under the promoter of dat-1. Survival after heat stroke of dopaminergic neurons 
of Is[pdar-1GFP];Ex[Pier-sssHSP-16.1] young adult transgenic animals was 
assessed 5 h after the assay. Neurons showing soma or axonal degeneration were 
considered dead. For propidium iodide staining (a fluorescent dye excluded from 
viable cells and apoptotic corpses), young adult worms were incubated for 3 h in 
10 1M propidium iodide (Sigma) in M9 buffer, after incubation at 39°C for 
15min, and visualized using a compound epifluorescence microscope. For 
monitoring of induction of expression of the asp-4 gene, young adult worms 
carrying the asp-4::GFP reporter fusion were incubated at 39 °C for 15 min and 
the expression of GFP in vacuolated cells was monitored after 3h using a com- 
pound epifluorescence microscope. 

C. elegans survival assays. To investigate the protective role of preconditioning 
against hypoxic conditions, wild-type nematodes at the L4 stage of development 
were collected and washed 2 to 3 times with 1 ml M9, preconditioned for 15 min at 
34 °C and left to recover for 10 min. Control animals were maintained at 20 °C for 
the duration of preconditioning. To induce hypoxia, control and preconditioned 
animals were incubated for 1h at 25°C, in 1 ml of 0.5M freshly made NaN; 
(Sigma) in M9. Sodium azide is an inhibitor of the respiratory chain electron 
transport complex IV (cytochrome c oxidase) and simulates hypoxia. Worms were 
washed 3 times with 1 ml M9 and placed into NGM plates to recover. The per- 
centage of living worms was calculated after 12 to 16 h of recovery. For pharmaco- 
logical assays, starting at hatching, worms were grown on NGM plates containing 
10mM EGTA or 10 pM dantrolene, and survival after heat stroke was tested at the 
young adult stage. For time-course analysis of heat-stroke survivors either without 
or after heat preconditioning, animals were scored over time for free locomotion or 
movement of the head or tail region of the body after gentle tapping of the plate. 
Animals positive for any of the above criteria were considered as alive. 

Ca’* monitoring. For intracellular Ca** monitoring experiments, transgenic 
animals expressing the Ca’* reporter GCaMP2.0 in the six touch receptor 
neurons were examined under a Zeiss Axiolmager Z2 epifluorescence micro- 
scope. For heat-stroke-induced neurodegeneration, heat-treated and precondi- 
tioned animals were monitored 3h after the corresponding treatments. For 
necrosis triggered by the toxic MEC-4(d) channel, early hatched L1 larvae were 
monitored. Only neurons of very initial stages of degeneration (based on 
morphological features using DIC microscopy) were used for analysis, as the 
expression of GCaMP2.0 ceases during later stages of neurodegeneration. 
Images of neuron cell bodies were acquired under the same exposure. The 


emission intensity of GCaMP2.0 was calculated by using the Image] software 
(http://rsb.info.nih.gov/ij/). 

Electron microscopy. High-pressure freezing (HPF) was carried out as described 
previously* and 50 nm sections were analysed. Micrographs were taken with a 
1,024 X 1,024-pixel charge-coupled device detector (Proscan CCD HSS 512/ 
1024; Proscan Electronic Systems) in an electron micrograph (EM902A; Carl 
Zeiss) operated in the bright field mode. 

Messenger RNA quantification. To quantify hsp-16.1, hsp-16.41, hsp-70, dnj-19 
and pmr-1 mRNA levels, total RNA was extracted using the TRIzol reagent 
(Invitrogen). The following sets of primers were used respectively: 5'-GGCT 
CTCCATCTGAATCTTCTGAG-3’ and 5'-TTCAAATCTTCTGGCTTGAACT 
GC-3’, 5'-TCATGCTCCGTTCTCCATATTC-3’ and 5’-ATTCATCATTTAC 
AATCTCCCC-3', 5'-AGCTCTGTGCTGATCTTTTCCG-3’ and 5’-ATGGAG 
CAGTTGAGGTCCTTCC-3’, 5'-AGATGTTATCAAGCCAGGAGTC-3’ and 
5'-AGCAGCTTTTCAGAACGGCAT-3’, 5'-ACGTGGAGAGAGTATGCAA 
TC-3' and 5’-TACCGAGGGATTGACCAATGC-3’. Results were normalized 
to genomic DNA using the following primers specific for ama-1: 5'-CCT 
ACCTACACTCCAAGTCCATCG-3' or 5’'GGTGAAGCTGGCGAATACGT 
TG-3’. For cDNA synthesis, mRNA was reverse transcribed using the iScript 
cDNA Synthesis Kit (BioRad).Quantitative PCR was performed in triplicate 
using the Bio-Rad CFX96 Real-Time PCR system (BioRad). 

Neuronal cultures. ESC-derived neurons were obtained from the differentiation 
of J1 ESCs following previously described protocols”. For primary cortical cul- 
tures, the cortex was dissected from E16 mouse embryos in PBS (137 mM NaCl, 
2.7mM KCl, 8mM NazHPO, and 1.46 mM KH>PO,) containing glucose (0.2%) 
and BSA (0.1%), and was trypsinized for 15 min in 0.5% trypsin solution at 37 °C, 
followed by mechanical dissociation with a glass capillary. Neurons were plated at 
an initial density of 200,000 cells per cm” and cultured in neurobasal medium 
supplemented with 2% B27, 200M glutamine, 5pgpl' penicillin and 
12.5 gpl! streptomycin. For some experiments, ESC-derived and cortical 
neurons were infected with lentiviruses 24h after plating. For Pmr1 knockdown, 
functionally validated shRNAs against PMRI (Invitrogen, previously described 
in ref. 56) were electroporated into ESCs. ESCs stably expressing the shRNAs 
were selected using 240 1g pil’ G418, and single clones were isolated and differ- 
entiated into neurons as described earlier”. 

Mammalian neuron heat stroke and preconditioning assays. Preconditioning 
of neuronal cultures was carried out by placing them at 39 °C for 1h in a CO; 
incubator, and cultures were placed back to 37 °C for 1h to recover before pro- 
ceeding to heat stroke. Heat stroke was simulated by exposing the neuronal 
cultures to 42 °C for 20 min in a CO) incubator. For the duration of the assays, 
the level of CO2 was kept constant at 5%, similar to normal culturing conditions. 
After heat stroke, cells were returned to 37 °C for 15 min to recover and during 
this period propidium iodide (3 1gml~') was added to the medium to stain 
necrotic neurons. 

Production of lentiviruses. A modified LenLox3.7 plasmid (pLV) containing the 
human synapsin (hSYN) promoter’’ was used as a control. Cryaa cDNA was 
cloned after the hSYN promoter of pLV using BamHI and NofI sites. Lentiviruses 
were produced in HEK293 cells triple-transfected transiently with pVSV-G and 
pA8.1 vectors (gift from G. Sourvinos) and with either the empty pLV-hSYN or 
pLV-hSYN-Cryaa plasmids. The supernatant containing the viral particles was 
collected 2 to 3 days post transfection and titre was determined by serial dilutions. 
Pmr1 knockdown. For Pmr1 downregulation, shRNAs against mouse Pmr1 were 
designed by modifying validated oligonucleotides previously used to downregu- 
late the highly homologous human Pmr1 (described in ref. 56). The resulting 
oligonucleotides that were used in our study are the following: 5’-CGTC 
AAGAAGTAACATCGCCTTCA-3’ and 5’-CAGTGGATAAAGATGTCATT 
CGAAA-3’. An short interfering RNA (siRNA) expression vector containing a 
neomycin resistance cassette (Genscript, catalogue no. $D1100) was used to 
express the shRNAs under the H1 promoter. The shRNA inserts were built using 
the above oligonucleotides in a sense-loop-antisense-termination sequence 
flanked by BamHI and HindIII sites on the 5’ and 3’ ends, respectively. The loop 
sequence used was TTCAAGAGA and the termination signal TTTTTT. 
Therefore, the sequences of the shRNA inserts are as follows: insert 1: 
5'-GGATCCCCGTCAAGAAGTAACATCGCCTTCATTCAAGAGATGAAGG 
CGATGTTACTTCTTGACGTTTTTTCCAAAGCTT-3’, and insert 2: 5’-GGA 
TCCCCAGTGGATAAAGATGTCATTCGAAATTCAAGAGATTTCGAATGA 
CATCTTTATCCACTGTTTTTTCCAAACGTT-3’. The shRNA inserts were 
cloned into the vector using BamHI and HindIII sites, also present as unique 
sites in the vector. Plasmids containing the shRNAs were linearized using Nofl, 
phenol-chloroform extracted and twice precipitated with ethanol, and electropo- 
rated into ESCs. Electroporation was carried out using an ECM830 square wave 
BTX electroporator (set at 320 V, 1 pulse, 99 ts). Electroporated ESCs were plated 
on neomycin-resistant feeders and were treated with 240 1g ml’ G418, starting 
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24h after electroporation and for 7 days. Neomycin-resistant ESCs grew as sepa- 
rate colonies on the feeders. Single ESC colonies were picked mechanically, each 
plated in a well of a 96-well plate containing feeders and expanded in the presence 
of G418. Thereafter, four colonies were further differentiated into neurons and 
levels of Pmr1 were analysed by Western Blot analysis of lysates obtained from 
these neurons. 

Western blot analysis. Cells were lysed in 500 mM Tris-HCl pH 7.2, containing 
1M NaCl, EDTA, Triton X-100, Na-deoxycholate, 10% SDS, supplemented with 
1 mM DTT (dithiothreitol) and protease inhibitors (Roche) and placed for 20 min 
on ice followed by a 20-min centrifugation at 10,000g, at 4 °C. Supernatants were 
separated on an 8% acrylamide gel and transferred to a nitrocellulose membrane. 
After overnight blocking with 5% skimmed milk in TPBS (PBS with 0.1% Tween- 
20) at 4°C, membranes were incubated overnight with primary antibodies pre- 
pared in blocking solution, at 4°C. After several washes with TPBS, membranes 
were incubated for 1h with secondary, horse radish peroxidase (HRP)- 
conjugated antibodies, at room temperature. After several washes with TPBS, 
membranes were incubated with enhanced chemiluminescence (ECL) substrate 
for 5 min (Thermo Scientific) and developed using the LAS-3000 imaging system. 
The primary antibodies used for Western Blot analysis were anti-PMRI1 (gift of 
J. Vanoevelen and F. Wuytack, diluted at a ratio of 1:500) and anti-B-III tubulin 
(Covance, MMS-435P-250, diluted at a ratio of 1:5000). 
Immunocytochemistry. Cells were fixed for 20 min at 37 °Cin 4% paraformaldehyde 
(PFA). After several washes with PBS, cells were incubated in blocking solution 
(10% serum, 0.2% Triton-X in PBS) for 1h at 25 °C and then overnight at 4 °C in 
blocking solution containing primary antibody. After several washes with PBS, 
cells were incubated with secondary antibody diluted in PBS. The nuclear dye 
Hoechst 33342 (Thermo Scientific) was also added during this incubation period. 
The following primary antibodies were used: anti-B-III tubulin (Covance, MMS- 
435P-250, diluted at a ratio of 1:2000), anti-crystallin wA (Santa Cruz sc-22389, 
1:500), anti-mannosidase II (gift of S. Mitrovic, diluted at a ratio of 1:2000) and 
anti-PMRI (gift of J. Vanoevelen, diluted at a ratio of 1:500). For the co-labelling of 
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CRYAA and PMR1, an antigen retrieval treatment was necessary. Briefly, cells 
were placed in 10 mM sodium citrate and microwaved for 30s so that the solution 
came to boil. Subsequently, they were immediately placed on ice for 5 min. This 
boil-cold cycle was repeated two more times and then the citrate buffer was 
removed and cells were incubated in blocking solution and stained following the 
standard procedure described above. Images were acquired using the Zeiss LSM 
710 confocal microscope. 

Statistical analysis. The Prism software package (GraphPad Software) and the 
Microsoft Office 2003 Excel software package (Microsoft Corporation) were used 
to carry out statistical analyses. Mean values were compared using unpaired 
t-tests. For multiple comparisons, we used one- and two-way ANOVA, corrected 
by the post-hoc Bonferroni test. Information about P values and experiment 
specifics (n, statistical significance tests) for all main figures are provided in their 
respective figure legends. 
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Retinal waves coordinate patterned 
activity throughout the developing 


visual system 


James B. Ackman!, Timothy J. Burbridge! & Michael C. Crair! 


The morphological and functional development of the vertebrate nervous system is initially governed by genetic factors 
and subsequently refined by neuronal activity. However, fundamental features of the nervous system emerge before 
sensory experience is possible. Thus, activity-dependent development occurring before the onset of experience must be 
driven by spontaneous activity, but the origin and nature of activity in vivo remains largely untested. Here we use optical 
methods to show in live neonatal mice that waves of spontaneous retinal activity are present and propagate throughout 
the entire visual system before eye opening. This patterned activity encompassed the visual field, relied on cholinergic 
neurotransmission, preferentially initiated in the binocular retina and exhibited spatiotemporal correlations between 
the two hemispheres. Retinal waves were the primary source of activity in the midbrain and primary visual cortex, but 
only modulated ongoing activity in secondary visual areas. Thus, spontaneous retinal activity is transmitted through the 
entire visual system and carries patterned information capable of guiding the activity-dependent development of 
complex intra- and inter-hemispheric circuits before the onset of vision. 


The display of spontaneous activity is an emergent property of the 
immature nervous system that is thought to mediate synaptic competi- 
tion’ and instruct self-organization in many developing neural circuits**. 
If spontaneous activity is structured with respect to the topographic 
organization of first-order sensory and motor circuits, then these activity 
patterns could provide a template for activity-dependent development of 
downstream synaptic connectivity throughout the nervous system’. 
Whether spontaneous patterned activity exists and is communicated 
through all levels of organization for any sensory system during develop- 
ment is currently unknown. 

In the developing visual system, isolated preparations of retina 
exhibit propagating bursts of action potentials among neighbouring 
retinal ganglion cells (RGCs), termed ’retinal waves’ *. As RGCs relay 
visual information to higher-order structures in the central nervous 
system, retinal waves are thought to have a key role in the activity- 
dependent refinement of topographic neural maps in the superior col- 
liculus, dorsal lateral geniculate nucleus (LGN) and visual cortex”, 
which exhibit functional connectivity before the onset of visual experience. 
However, the role of retinal waves in neural-circuit development 
remains controversial’*”’, partly because their existence has never been 
demonstrated in vivo. 

Previous work using extracellular microelectrode recording tech- 
niques in vivo showed limited and local correlated spiking between 
pairs of embryonic rat RGCs'*, but no direct assessment of wave 
activity has been conducted in vivo, probably because of the methodo- 
logical challenges associated with recording from a large cohort of 
RGCs in neonatal animals. In neonatal rat visual cortex, extracellular 
recordings show bursts of spiking activity and slow spreading oscilla- 
tions that are sensitive to the presence of retinal input in the neocortex 
in vivo'’"*, but patterned spontaneous activity in and among different 
visual areas associated with retinal waves has never been observed. We 
sought to establish whether travelling waves of spontaneous retinal 
activity occur in neonatal mice in vivo and to determine whether 


spontaneous retinal waves convey spatiotemporal patterns suitable for 
the activity-dependent refinement of visual maps throughout the ner- 
vous system before the onset of vision. 


Spontaneous retinal waves occur in vivo 


The most superficial layer of the mouse superior colliculus, the stratum 
griseum superficiale (SGS), receives retinotopically mapped terminal 
input from virtually all RGCs’° and is completely accessible to imaging 
during development. We recorded directly from RGCs in vivo by 
injecting a calcium indicator, calcium green-1 dextran (CaGr-Dx), into 
the retina of neonatal mice and imaging the anterogradely labelled 
RGC axon arbors in the superior colliculus between 3 and 9 days after 
birth (postnatal day 3 (P3)-P9) through a cranial window using multi- 
photon or wide-field fluorescence excitation (Fig. 1). Calcium imaging 
of RGC terminals revealed spontaneous waves of activity propagating 
throughout the superior colliculus of neonatal mice (Fig. 1d-f and 
Supplementary Movie 1). Waves propagated among labelled RGC 
terminals at all depths examined within the SGS (range: 76-206 um 
below the pial surface). Wave-front velocity (median + median abso- 
lute deviation: 44.17 + 15.99 um s ijn=765 waves) was 3 to 4 times 
slower than previously reported wave speeds measured in the mouse 
retina in vitro”**', which is consistent with the diameter of the superior 
colliculus being approximately 3 times smaller than that of the retina. 
Retinal waves occurred with a mean frequency (0.68 + 0.08 waves per 
min; n = 19 hemispheres) that was sensitive to age (Supplementary 
Fig. 1) and similar to the frequency observed in vitro”*'. Surprisingly, 
waves in vivo propagated over a much greater proportion of the 
superior colliculus area than was expected based on previous in vitro 
studies (median wave size + median absolute deviation: 0.52 + 0.30 mm’, 
corresponding to ~30% of superior colliculus retinotopic area, com- 
pared with an average wave size of ~1% of retina area in vitro”) 
(Supplementary Fig. 2). Waves were completely abolished after appli- 
cation of the Na’ channel antagonist tetrodotoxin (TTX) into the 
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Figure 1 | Spontaneous waves of activity in retinal ganglion cell arbors in 
vivo. a, Calcium-dye labelling and two-photon imaging. b, Superior colliculus 
(SC) craniotomy from a P5 mouse overlaid with the corresponding two-photon 
excitation image of CaGr-Dx-labelled RGC axon arbors. c, Higher- 
magnification image of CaGr-Dx-labelled RGC axons from inset in b, 82 um 
below pial surface. d, Montage (AF/F) of a spontaneous wave recorded in RGC 
axon arbors (from the same labelled field shown in b). e, Calcium transients 


contralateral eye (Fig. 1g, h and Supplementary Fig. 3) (n = 2 animals; 
region of interest (ROI) active fraction: control, 0.78 + 0.07; TTX, 
0.00 + 0.00), but not the ipsilateral eye (n = 2 animals, ROI active 
fraction: control, 0.89 + 0.09; TTX, 0.999 + 0.001). 


Retinal waves propagate to collicular neurons 

Most models for activity-dependent neural-circuit development 
assume that patterned afferent activity drives target-neuron spiking 
in order for Hebbian plasticity rules to strengthen synapses between 
co-active neurons. Previous work carried out in an intact mouse 
retinogeniculate preparation ex vivo showed that spontaneous retinal 
activity can drive dLGN neurons above spike threshold” and in vivo 
multi-electrode recordings in the ferret showed that dLGN neurons 
exhibit correlated episodic bursting that was sensitive to ablation of 
retinal input” just before eye opening. However, it remains unknown 
whether spontaneous retinal waves in vivo can actually drive wave- 
like network activity in RGC neural targets, a prerequisite for Hebbian 
refinement at developing synapses. We performed cellular-level calcium 
imaging in the SGS using bulk loading of the calcium indicator OGB1- 
AM (Fig. 2a, b) to determine whether retinal waves trigger travelling 
waves in retinorecipient neurons in the superior colliculus. Bolus- 
indicator loading in the superficial layer of the superior colliculus labelled 
>1,000 neurons within a 550 X 550-1m field of view (Fig. 2b, c). We 
first imaged at low power over the network of cells and neuropil in 
the same way as for CaGr-Dx imaging (Fig. 1). This revealed pro- 
pagating calcium waves travelling within the retinorecipient layer of 
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from RGC axon arbors during the wave shown in d. f, Raster plot and activity 
histogram from a 677-s recording. Points indicate calcium-transient onsets 
from individual ROIs. Same wave from d shown on right at expanded time 
scale. g, Dye labelling and wide-field CCD imaging for bilateral recordings. 
h, Raster plots from animal shown in g. Intraocular TTX application blocks 
waves in the contralateral hemisphere. L-SC, left SC; PMT, photomultiplier 
tube; R-eye, right eye; R-SC, right SC; Ti:Sapphire, Ti:Sapphire laser. 


the superior colliculus (range: 34-172 um below the pial surface) 
(Fig. 2d-f and Supplementary Movie 2). These waves occurred at the 
same frequency (0.69 + 0.17 waves per min, n = 11 hemispheres) as 
the retinal waves recorded presynaptically in RGC axons (P = 0.9497, 
t-test) (Supplementary Fig. 1) and activated a similar percentage of the 
labelled area in the superior colliculus per recording (CaGr-Dx ROI 
active fraction: 0.79 + 0.06, n= 19 hemispheres; OGB1-AM ROI 
active fraction: 0.74 + 0.08, nm = 11 hemispheres; P = 0.6228, t-test). 
Waves propagated among OGB1-AM-labelled cells in the superior 
colliculus with a median wave-front velocity of 36.11 um s~' (median 
absolute deviation: 13.00 ums’, n = 189 waves), which was similar to 
the wave speeds observed with presynaptic recordings (CaGr-Dx 
mean: 42.93 + 1.73 ums_', OGB1-AM mean: 43.04+ 6.68 ums’, 
P = 0.9881, t-test). Together, these results indicate that the spatiotem- 
poral properties of presynaptic retinal waves match those of post- 
synaptic retinal waves measured in the superior colliculus. 

We next performed high spatial resolution two-photon imaging on 
active regions observed at low magnification. We found that calcium 
waves propagated among superior colliculus neurons (Fig. 2g-i) 
(fraction of cells active per wave, 0.48 + 0.07; fraction of cells active 
in a wave during a 10-min recording, 0.87 + 0.08; n = 787 cells, 
13 waves, 3 recordings), and that the activity of nearby cells was more 
temporally correlated than distant cells (Fig. 2i, j and Supplementary 
Fig. 5). The travelling waves among cells in the retinal-input layer of 
the superior colliculus were abolished when spiking activity was 
blocked in the contralateral eye (Fig. 2h and Supplementary Fig. 3) 
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(n = 6 hemispheres; ROI active fraction: control, 0.80 + 0.08; TTX, 
0.004 + 0.003) but not the ipsilateral eye (n = 2 animals, ROI active 
fraction: control, 1.00 + 0.00; TTX, 0.998 + 0.002). These experi- 
ments show that retinal waves drive wave-like activation patterns in 
retino-recipient target neurons, indicating that spontaneous activity in 
the retina provides a template pattern that is matched in higher-order 
circuits in the visual system. 


Binocular coordination of retinal waves 


Retinal waves are thought to emerge from recurrent excitatory con- 
nections in networks of spontaneously active amacrine cells and 
ganglion cells”, and are presumed to originate randomly throughout 
the retina. We examined the initiation sites of retinal waves in vivo by 
analysing the calcium event response frequencies for all active ROIs at 
the onset of waves. Interestingly, we discovered that instead of a 
uniform distribution of wave-initiation sites, waves preferentially 
nucleated in the rostral-medial superior colliculus (Fig. 3a) (n = 995 
waves). Consistent with this preferential site of wave generation, 
waves propagated with a marked directional bias towards the caudal- 
lateral superior colliculus, equivalent to a directional preference for the 
dorsal-nasal pole of the retina (Fig. 3b) (1 = 995 waves, Rayleigh test of 
Uniformity, P = 0). This directional bias was the same for both pre- 
synaptic and postsynaptic recordings (Supplementary Fig. 6), and was 
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similar to but stronger than the wave-direction bias reported in a recent 
in vitro study~’. The overall preference for waves to propagate towards 
the dorsal—nasal retina is aligned with but directly opposite the direc- 
tion preference of RGCs and visual cortical neurons at the time of eye 
opening’, which corresponds to movement towards the ventral- 
temporal pole of the retina. It is possible that the pronounced wave- 
direction bias we observed mediates the development of aspects of this 
directionally sensitive visual circuit. The notable preference for wave 
initiation in the rostral-medial superior colliculus (Fig. 3a) indicates a 
corresponding site of wave generation in the ventral-temporal retina, 
which represents the binocular portion of the visual field in mice. This 
suggests that retinal waves may have an enhanced role in mediating 
development of binocular aspects of maps in the visual system, such as 
the formation of eye-specific connectivity and matched orientation 
selectivity between the eyes. 

As retinal waves arise spontaneously, it is commonly assumed that 
waves in each eye are autonomous and drive their corresponding visual 
circuits independently, giving rise to a completely asynchronous 
activation pattern that could be used for segregation of eye-specific 
projections®**. As unilateral TTX blockade or enucleation abolished 
presynaptic and postsynaptic retinal waves (Supplementary Fig. 3), 
we anticipated that the retinal waves would occur independently in 
the two superior colliculus hemispheres. Unexpectedly, simultaneous 
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Figure 2 | Spontaneous waves of correlated activity among superior- 
colliculus neurons in vivo. a, Experimental overview. b, OGB1-AM bulk 
loading in the SC at P4. c, Two-photon excitation image of OGB1-AM-labelled 
cells from inset in b, Image from 56 jm below pial surface. d, Montage (AF/F) 
of a spontaneous wave recorded in the SC (from the same labelled field shown 
inc). e, Calcium transients in local ROIs during the wave shown in d. f, Raster 
plot and activity histogram during a 610-s recording (from the same 
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experiment as d and e). g, Calcium imaging in single SC cells loaded with 
OGB1-AM. h, Single-cell ROIs and calcium transients during a wave. i, Plot of 
neurons active during two sequential waves. j, Pairwise cell correlations as a 
function of pair distance. Dashed line, the chance distribution. k, Bilateral dye 
labelling and example wide-field CCD image. 1, Raster plots from a bilateral SC 
recording. Intraocular TTX application blocks waves in the contralateral 
hemisphere. 
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Figure 3 | Retinal waves originate in the ventrotemporal retina and 
propagate bilaterally. a, Contour map of normalized ROI response 
frequencies in the SC within 2s of wave onset. b, Wind-rose histogram of all 
wave directions in SC. c, Montage (AF/F) of synchronized retinal waves 
recorded in OGB1-AM-labelled cells in both hemispheres of the SC. d, Peri- 
event time histograms of calcium-event latencies relative to wave onset for the 
same or opposite hemispheres. ROI event latencies: same hemisphere (Same 


bilateral calcium imaging of the superior colliculus revealed that retinal 
waves sometimes travelled with matched spatial patterns in both hemi- 
spheres at the same time (Fig. 3c-f). These bilaterally coordinated 
waves were found when recording from presynaptic afferents in the 
superior colliculus with CaGr-Dx (Supplementary Movie 3) and from 
the postsynaptic networks within the superior colliculus labelled with 
OGB1-AM (Fig. 3c and Supplementary Movie 4). Temporally matched 
waves occurred throughout the superior colliculus and constituted a 
small (15.8%) but significant subset of all waves (P = 0.00098, t-test). 
Synchronous waves in the two hemispheres of the superior colliculus 
also typically propagated in a highly correlated manner spatially 
(85.0% of temporally correlated waves were also spatially correlated, 
accounting for 13.4% overall; P = 0.004535, t-test; Fig. 3c-e). Retino- 
petal projections as well as retino-retinal connectivity have been 
reported in the developing mammalian retina*’*° (Supplementary 
Fig. 7). Thus, it is possible that descending synchronized inputs or 
synaptic interactions between the two retinas may cause a small subset 
of retinal waves to initiate at matched retinotopic locations bilaterally. 
Because commissural connections between the hemispheres in super- 
ior colliculus and visual cortex are retinotopically coordinated***” and 
orientation selectivity is matched between the eyes in the binocular 
visual cortex, even in the absence of visual experience*’, a possible 
function for synchronous retinal waves is to regulate bilateral matching 
of visual map connectivity. 


Retinal waves propagate to, and within, visual cortex 

Visual information from the retina is routed to the primary visual 
cortex (V1) through the dLGN of the thalamus. Retinogeniculate 
input to the dLGN in mice originates from collateral fibres that arise 
from a subset of retinocollicular afferents. Therefore, retinal waves 
transmitted to the superior colliculus should also be relayed to the 
visual cortex through the thalamus. To test whether retinal waves 
induce propagating waves within visual cortex, we carried out simul- 
taneous wide-field calcium imaging of the superior colliculus and 
ipsilateral visual cortex (Fig. 4a—-c). We found that travelling waves 
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hemi), 1 = 103,354; opposite hemisphere (Opp hemi), 1 = 97,280. e, Fraction 
of presynaptic and postsynaptic waves having significant temporal 

(P = 0.00098, t-test; CaGr-Dx: 52/332 waves; OGB1-AM: 20/90 waves) and 
spatial (P = 0.004535, t-test; CaGr-Dx: 46/332 waves; OGB1-AM: 16/90 waves) 
correlations bilaterally. Error bars, mean + s.e.m. f, Cumulative distributions of 
the spatial-similarity metric across hemispheres for observed wave pairs and 
random wave pairs. ***P = 6.072e °°, t-test. 


in the visual cortex were coincident with retinal waves propagating 
across the ipsilateral superior colliculus (Fig. 4d, e) (coactive frac- 
tion, V1/superior colliculus = 50/81 waves, n = 4hemispheres), and 
inter-wave intervals were similar in superior colliculus and V1 (Fig. 4f) 
(P6-P9 superior colliculus, 40.53 + 2.81 s, n = 133 waves; P6-P9 V1, 
49.01 + 4.79 s, n = 62 waves; P = 0.1292, t-test). The retinotopic map 
in V1 is mirrored and rotated with respect to the retinotopic map in the 
ipsilateral superior colliculus (Supplementary Fig. 8). Notably, after 
correcting for the mirrored and rotated maps, the direction of wave 
propagation in visual cortex was retinotopically matched to the direc- 
tion of wave travel in the superior colliculus (Fig. 4g, h) (r = 0.835, 
P=4.707e ‘*, Pearson’s correlation, n = 50 waves). These results 
indicate that retinal waves carry functional information corresponding 
to retinal organization simultaneously throughout multiple areas of the 
visual system during development. 

The V1 in mice contains a retinotopic map of the contralateral 
visual hemifield and is adjoined by up to nine extrastriate visual areas 
homologous to those seen in carnivore and primates™, each of which 
contains a duplicate map of retinal topography. To gain a better 
understanding of the overall contribution of retinal waves to ongoing 
patterned activity among developing visual cortical areas in mice, we 
used a Cre/loxP approach (Emx1-Cre:Ai38 mice) to express the gene- 
tically encoded calcium sensor GCaMP3 in all excitatory cortical 
neurons”. Wide-field fluorescent GCaMP3 signals from neocortical 
neurons were imaged directly through the skull simultaneously with 
OGB1-AM signals through a cranial window over the superior colli- 
culus (Fig. 5a, b). Retinal waves in the superior colliculus propagated 
concurrently with waves in the V1 and separate, secondary wave 
fronts were initiated in extrastriate cortical areas (Fig. 5c and Sup- 
plementary Movie 5). We created maps of retinal topography in the 
visual cortex and midbrain by colorizing and merging AF/F image 
frames based on the locations of superior colliculus retinal wave fronts 
within 10-min recordings (Fig. 5d). This functional map of wave- 
based retinotopy revealed a large primary map in visual cortex that 
was mirrored and rotated with respect to the superior colliculus map 
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Figure 4 | Retinal waves propagate simultaneously in the SC and visual 
cortex. a, Experimental overview. b, Right visual cortex (R-V1) and SC (R-SC) 
craniotomies from a P9 mouse. c, OGB1-AM bulk loading in V1 and SC. 

d, Montage (AF/F) of spontaneous waves recorded in SC and V1 
simultaneously. e, Raster plot from the recording in d. f, Box-plots show 
interval between successive wave onsets in SC and V1 between P6 and P9. Box 


(Fig. 5d) and that was remarkably consistent with the known visual-map 
organization of adult mouse V1 compared with superior colliculus**”*. 
Although the frequency of calcium events was higher in V1 and extra- 
striate cortex relative to superior colliculus (Fig. 5e) (median + median 
absolute deviation, superior colliculus: 0.010 + 0.003 Hz; V1: 0.013 
0.003 Hz; extrastriate: 0.017 + 0.003 Hz), most of the activity in V1 
occurred within seconds of a retinal wave (Fig. 5f). Retinal waves also 
triggered activity in secondary visual cortical areas, but much of the 
ongoing cortical activity in extrastriate areas seemed to be independent 
of retinal waves (Fig. 5e-g), indicating a strong role for intra- or sub- 
cortical inputs separate from the primary visual pathway in driving 
ongoing activity within developing extrastriate cortex. Ablation of con- 
tralateral retinal input dramatically diminished spontaneous activity in 
the superior colliculus and V1, but in surrounding extrastriate visual 
areas the frequency of ongoing activity was unaffected although the 
spatial distribution of this activity was shifted noticeably (Fig. 5e, g) 
(median + median absolute deviation, superior colliculus: 0 + 0 Hz; 
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edges, first and third quartiles. Horizontal line, median. Whiskers, range. 

g, Activity maps showing the direction of wave propagation for 3 waves in V1 
and SC indicated in e. Top-left panel shows the approximate retinotopy of 
mouse V1 and SC. d, dorsal; n, nasal; t, temporal; v, ventral. h, Plot of SC retinal- 
wave directions versus V1-wave directions normalized to SC coordinates, with 
linear regression (blue line) and 95% confidence intervals (grey shading). 


V1: 0.002 + 0.002 Hz; extrastriate: 0.017 + 0.005 Hz and Supplemen- 
tary Fig. 9). These results indicate that retinal waves provide a primary 
source of patterned activity for V1 during early development and 
further modulate activity in extrastriate visual areas. Retinal waves 
convey patterned information suitable for driving activity-dependent 
development of neocortical circuits before the onset of vision and may 
thus prompt coordinated alignment and refinement of topographic 
maps across the visual system. 


Retinal waves are abolished by epibatidine 


Classic studies show that pharmacologic manipulation of retinal waves 
in vivo with epibatidine, a high-affinity agonist for nicotinic aceytlcho- 
line receptors, disrupts visual-map development in ferret”’®’**” and 
mouse***°. These studies relied on in vitro evidence that epibatidine 
blocks retinal waves”“', but the effects of epibatidine on wave genera- 
tion in vivo and the link between retinal waves and visual-map develop- 
ment remain uncertain. We generated Rx-Cre:Ai38 mice, which express 
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Figure 5 | Retinal-wave-driven activity in V1 and extrastriate visual areas. 
a, Experimental overview. b, Image of transcranial cortical GCaMP3 expression 
and OGB1-AM bulk loading in SC of a P6 Emx1-Cre:Ai38 mouse. c, Montage 
(AF/F) of a retinal wave recorded in SC and V1 simultaneously from same 
recording as in b. Arrows, direction and distance of wave travel. Arrowheads, 
activations in V2. d, Topographic maps colourized by SC retinal-wave-front 
position reveal putative boundaries of V1 and secondary visual areas (areas 
18a and 18b) from the same animal as in c. e, Cumulative distributions of 
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calcium-event frequencies before (control) and after ablation (enucleation) of 
contralateral (Contra-enuc) or ipsilateral (Ipsi-enuc) eye. SC, n = 2,862 ROIs; 
V1, n= 3,490 ROIs; extrastriate, 7,717 ROIs. f, Peri-event time histograms of 
calcium-event latencies relative to SC-wave onset. SC, n = 52,284 events; V1, 
n = 38,244 events; extrastriate, n = 49,363 events. g, Maximum AF/F intensity 
projections from 10-min recordings before and after contralateral eye 
enucleation (from the same recording as in d). 
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Figure 6 | Retinal waves depend on cholinergic 
neurotransmission. a, Experimental overview. 

b, Image of transcranial cortical GCaMP3 
expression and RGC-axon GCaMP3 expression in 
SC in a P4 Rx-Cre:Ai38 mouse. c, Topographic 
maps colourized by SC retinal-wave-front position 
for a single retinal wave simultaneously 
propagating in SC and V1. d, Montage (AF/F) of a 
single retinal wave from same recording as in 

c. e, Montage (AF/F) showing typical activity 
patterns in visual cortex after contralateral 
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the genetically encoded calcium reporter, GCaMP3, in both RGCs 
and forebrain neurons to perform simultaneous recordings of retinal 
waves in RGC axons and the visual cortex. Retinal waves propagated 
simultaneously in superior colliculus and V1 (Fig. 6c, d) and were 
completely abolished after monocular injection into the contralateral 
eye of 1,00011M epibatidine (Fig. 6e, f) (control, n=58 movies, 
0.96 + 0.05 waves per min; contra epi, n = 14 movies, 0 waves per min; 
P=8.2e ', pairwise-t-test) (Supplementary Movie 6), the concentra- 
tion typically used in vivo to examine the role of retinal waves in visual- 
map development’**”~*°. Notably, retinal waves were also abolished 
in superior colliculus and V1 ipsilateral to the injected eye (Fig. 6f) 
(n= 14movies, 0 waves per min; P= 8.2e **), suggesting that at 
1,000 uM epibatidine could mediate some of its effects systemically. 
Consistent with a concentration-dependent systemic effect, monocu- 
lar injection of 0.01 4M or 0.2 uM epibatidine significantly reduced 
wave frequency in the contralateral (0.01 1M, 0.36 + 0.12 waves per 
min, m = 9 movies, P = 0.00012; 0.2 uM, 0.45 + 0.10 waves per min, 
n= 12 movies, P = 0.00028, pairwise-t-test) but not the ipsilateral 
hemisphere (0.01 uM, 0.60+0.18 waves per min, n= 9 movies, 
P=0.06280; 0.24M, 0.82 +0.08waves per min, n= 12 movies, 
P= 0.89135). These results are consistent with long-standing evidence 
based on in vitro recordings® that cholinergic neurotransmission has 
an essential role in generating retinal waves, and that in vivo manipu- 
lations with epibatidine disrupt visual-map development by abolishing 
retinal waves throughout the visual system. 


Discussion 


There is substantial in vitro evidence for spontaneous waves of 
neuronal activity in the prenatal or postnatal retina of a number of 
vertebrate species, including rodent, ferret and monkey***. Slow 
activity transients or bursting activity measured with electroence- 
phalography (EEG) or extracellular electrodes in the dLGN and visual 
cortex of rat or ferret in vivo before eye opening are modulated by 
retinal input and therefore believed to be driven by retinal waves'***™*. 
These slow activity transients are homologous to the slow activity 
EEG transients recorded in preterm human fetal occipital cortex*”®. 
As these self-organized patterns of spontaneous activity are present 
before the onset of patterned vision, it has long been thought that retinal 
waves may help organize aspects of visual circuit function that are 
established before visual experience, such as maps for orientation***”, 
direction” and ocular dominance**™. 

Our recordings in mouse pups aged P3-P9 indicate that spontaneous 
retinal waves are present for at least a week of development in vivo and 
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exhibit a pattern of activity appropriate for communicating retinal 
organization to circuits throughout the visual system. Imaging of 
RGC afferents or superior colliculus neurons together with simul- 
taneous imaging in V1 shows that retinal waves generate matched 
activity patterns in the midbrain and cortex, supporting the hypo- 
thesis that emergent retinal activity mediates the development of 
linked visual circuitry within and across multiple neocortical and 
subcortical brain regions”. 

Retinal waves in vivo exhibited a number of properties that could 
not have been predicted based on previous in vitro recordings, includ- 
ing a preferential site of generation in the ventral-temporal retina, a 
biased direction of wave travel towards the dorsal-nasal retina, and a 
differential role in driving primary versus secondary visual cortical 
activities (see also Supplementary Discussion). Unexpectedly, we also 
found that a subset of retinal waves exhibited coordinated patterns 
across both hemispheres. This suggests that bilateral retinal waves 
could help establish inter-hemispheric connections, a role not previously 
ascribed to waves. Moreover, the remarkable mirror-symmetric pattern 
of ocular-dominance columns between hemispheres observed in some 
monkeys”, which are difficult to explain based on the presumed ran- 
dom pattern of waves between the eyes, may be understood better in 
the context of the clearly non-random nature of waves across the two 
hemispheres found in vivo. 

A causal link between retinal waves and visual-map development 
has been difficult to establish definitively, partly owing to the lack of 
evidence for wave generation and propagation in vivo. Our experi- 
ments show that pharmacological manipulations that were previously 
shown to profoundly disrupt visual-map development?'®'**”~° also 
block wave generation in vivo, strengthening the causal link between 
wave activity and map development. 

Given the remarkable fidelity of retinal waves during the period 
before eye opening in mice in vivo that we report here, together with 
previous work showing that spontaneous waves within macaque 
retina are present in vitro before birth*’, it seems likely that the visual 
system experiences patterned activation by retinal waves for a sub- 
stantial gestational period during human fetal development that may 
be crucial for shaping the functional maturation of neural circuits 
before the onset of sensory experience. 


METHODS SUMMARY 


Anaesthetized C57BL/6 mice between P1 and P3 were injected with calcium 
green-1 dextran (CaGr-Dx) into each retina and returned to the dam for at least 
2 days before preparation for in vivo imaging of RGC axons. On the day of 
calcium imaging, a craniotomy was prepared over the superior colliculus under 
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anaesthesia, and a cover-glass was placed over the superior colliculus in warm 
agarose-buffered saline, pH 7.4. 

For postsynaptic calcium imaging, craniotomies were prepared in anaesthetized 
mice aged P3 to P9 over the superior colliculus and visual cortex. The calcium- 
sensitive dye oregon green BAPTA-1-AM (OGB1-AM) was injected into the 
superior colliculus and visual cortex and then a cover-glass was placed over the 
craniotomies. In vivo calcium imaging was carried out in head-fixed, un-anaes- 
thetized mouse pups using two-photon excitation through a X20, 1.0-NA (numer- 
ical aperture) objective lens or using wide-field epifluorescence imaging with low- 
magnification objectives (2-5) anda charge-coupled device (CCD) camera when 
the animals awoke during or after a 1-h recovery period from general anaesthesia. 
Calcium transients and waves were detected and analysed using custom routines 
implemented in MATLAB and R. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Animals. Animal care and use was in compliance with the Yale Institutional Animal 
Care and Use Committee (IACUC), and the US Department of Health and the 
Human Services. Neonatal wild-type mice (C57BL/6J) and Ai38 floxed GCaMP3 
reporter mice (JAX no. 014538)*° crossed with Emx1-Cre (JAX no. 005628) or 
Rx-Cre*! mice aged 1-9 days after birth (postnatal day 1 (P1)-P9) were used. 
Anterograde calcium-indicator labelling. For presynaptic calcium imaging 
experiments, 20% w/v calcium green-1 dextran (CaGr-Dx; 3000 MW, Invi- 
trogen, C6765) in saline was prepared and injected into the retinas of P1 to P3 
mice that were deeply anaesthetized on ice following protocols published prev- 
iously”. After injection of approximately 0.5 1] per eye, five square-wave 100-ms 
pulses of 25 V with 900-ms intervals were applied directly across each eye in both 
directions using methods similar to those published previously for DNA plasmid 
delivery*’. After electroporating each eye alone, whole-head electroporation was 
applied over both eyes with five 75-V, 100-ms square-wave pulses and 900-ms 
intervals in each direction. The animal was then given topical anaesthetic over 
each eye and allowed to wake and recover on a heating pad before being returned 
to the dam. This technique results in a labelled population of RGCs distributed 
across the majority of the retina and noticeable anterograde labelling of RGC 
axon-terminal arbors in the superior colliculus within 2 days of injection (Fig. 1a). 
Surgical procedure for in vivo imaging. Mice aged P3 to P9 were deeply anaes- 
thetized with isoflurane (2.5%) in oxygen and then placed on a heating pad set to 
36.5 °C using a homoeothermic temperature monitor (NPI TC-20, ALA Scientific). 
Local anaesthesia was produced by subcutaneous injection (0.05ml) of 1% xylocaine 
(10mg ml! lidocaine per 0.01mgml~' epinephrine, AstraZeneca) under the 
scalp. After removal of the scalp, steel head posts were fixed to the anterior and 
posterior portions of the exposed skull using cyanoacrylate glue. Isoflurane anaes- 
thesia was adjusted between 0.5-1.0% as necessary to maintain a stable respiratory 
rate. An approximately 2-mm oval craniotomy was created by gentle etching into 
interparietal skull and removing the resulting bone flap above the superior collicu- 
lus, just posterior to lambda using the tip of an 18G syringe needle. After achieving 
haemostasis, the dura was carefully removed using forceps and microdissection 
scissors. The craniotomy was filled with warm (37°C) low-temperature melt 
agarose (A9414, Sigma, 1.5% in sterile buffered saline, 150 mM NaCl, 2.5mM 
KCl, 10 mM HEPES, pH 7.4). 

Cellular bulk loading with a calcium-sensitive dye. Loading neurons in vivo 
with the calcium indicator oregon green BAPTA-1-AM (OGB1-AM; Invitrogen) 
was carried out using standard procedures similar to those described previ- 
ously**”’. Dye was prepared by dissolving 50 tg of OGB1-AM in 4 il 20% pluro- 
nic acid in DMSO (Invitrogen) and 35 1 of sterile buffered saline (150 mM NaCl, 
2.5mM KCl, 10mM HEPES, pH7.4) and 1 pl of 10mM Alexa 594 hydrazide 
(Invitrogen) and sonicating for 20 min. The solution was then filtered through a 
0.45-t1m microcentrifuge filter (Millipore). 

Pulled glass micropipettes (1B150F-4; World Precision Instruments) were 

loaded with OGB1-AM dye solution and inserted into an electrode holder con- 
nected to a Picospritzer III (General Valve Corp) set to an output pressure of 
10 psi. Bulk labelling with OGB1-AM”** was achieved at 2 to 4 injection sites per 
hemisphere by lowering the pipette to a depth of 100-300 tm below the pial 
surface of the superior colliculus or V1 with a motorized micromanipulator 
(MP-225, Sutter) and delivering 60 brief (20-ms) pressure pulses over the course 
of 2 min. A circular, 5-mm diameter no. 1.5 coverglass (Warner Instruments) was 
placed over the craniotomy in fresh agarose and stabilized with Kwik-Sil (World 
Precision Instruments). 
In vivo calcium imaging. A 1-h recovery period in the dark under continuously 
delivered medical oxygen with isoflurane at 0% was allowed after craniotomy 
surgery and dye injection was completed. This recovery period was the typical 
minimum time required for spontaneous waves of activity to develop in the visual 
system after the cessation of deep anaesthesia. Spontaneous waves occurred only in 
un-anesthetized mouse pups (26 out of 30 un-anesthetized mice at 0% isoflurane; 
0 out of 11 anaesthetized mice at =0.25% isoflurane, P = 6.099e °°) (Supplemen- 
tary Fig. 4). During the recordings, the head was fixed under the microscope 
objective by steel posts attached to the skull, and the body was loosely surrrounded 
by cotton gauze, such that the animals could move freely on the heating pad. 
During the experiment, mouse pups spent the majority of time in a quiet, resting 
state interrupted by brief motor twitches of limbs or tail. 

A two-photon microscope system (Ultima IV, Prairie Technologies) was used 
to image the calcium dyes. A Ti:Sapphire laser (Mai Tai, Spectra-Physics) was 
tuned to 800 nm for excitation of OGB1-AM and CaGr-Dx. Total power delivered 
to the brain was <50 mW. Imaging was carried out using a X20, 1.0-NA saline- 
immersed objective (no. 421452-9900, Zeiss). Image frames corresponding to a 
field of view of 550 X 550 um or 225 X 225 ttm or 138 X 138 um were acquired at a 
rate of 1.2 Hz, 7.5 Hz or 22.3 Hz, respectively. Each recording consisted of a single, 
continuously acquired movie during a period of 10 min. 


For wide-field calcium imaging, a CCD camera (Pixelfly, The COOKE 
Corporation) coupled to an Olympus BX51 and a X5, 0.25-NA objective 
(no. 440125, Zeiss) or X2.5, 0.075-NA objective (Zeiss no. 440310-9903) was 
used to image calcium responses in both hemispheres of the superior colliculus 
simultaneously or a collicular hemisphere and the ipsilateral visual cortex simul- 
taneously. Epifluorescent illumination was provided by a Hg’ light source 
(X-Cite Series 120, EXFO) through a neutral-density filter and a filter-cube set 
(U-MGFPHQ, Olympus) with the minimum illumination intensity that still gave 
detectable calcium signals using a CCD exposure of 200 ms. 

Ablation of retinal input. To abolish retinal activity, the animal was re-anaes- 
thetized under 2.5% isoflurane and the eyelid surgically opened with 1% xylocaine 
applied to the eye. For enucleation, the eye was surgically removed and 1% 
xylocaine applied to the orbit. For eye injections, 0.5-1 ul of 10 UM tetrodotoxin 
(TTX; Tocris no. 1069) or (+)-epibatidine dihydrochloride hydrate (10 nM, 
200nM or 1mM; Sigma no. E1145) dissolved in saline was pressure injected 
using a glass micropette inserted into the vitreous. After completion of the enuc- 
leation or eye injection, the animal was returned to 0% isoflurane with total 
procedure time under deep anaesthesia <10 min. 

Calcium-signal detection. Image processing and calcium-signal detection were 
carried out using custom software routines written in MATLAB (Mathworks). 
For low-magnification two-photon imaging movies or wide-field CCD movies, a 
rectangular grid of ROIs (for each ROI; h = 45 um, w = 45 pm) was masked over 
the average image, Fo, of visible calcium indicator fluorescence for each hemi- 
sphere of a movie. For cell-based analysis in higher-magnification two-photon 
imaging movies, a motion-correction algorithm described previously” was first 
applied to the movie to correct for small xy displacements within the focal plane of 
the movie due to cardiopulmonary or twitch-based movements. Unambiguous 
cell contours were semi-automatically identified in the average image, Fo, for cell- 
based movies and an ROI mask created inside each cell. Calcium signals for each 
ROI was the average fluorescence intensity inside each ROI in each frame, F,, 
measured as a function of time (4F/F = (F, — Fo)/Fo). Calcium transients were 
detected using automatic unbiased detection routines to identify local maxima 
(>2 standard deviations of the derivative of the signal). Image frames containing 
z-artefacts were identified as frames recorded during periods of time when a 
decrement or increment in fluorescence occurred in 100% of the ROI population 
spanning 2 frames or less, and were excluded from calcium-event detection. 
Calcium-event onsets were set as the first frame in the rising phase of the signal. 
Event offsets were set as the half-amplitude decay time for each calcium transient. 
Wave detection. A wave-form representing the population activity for each 
movie was constructed by smoothing the envelope of the population activity 
histogram for each movie with a 10-order Hanning filter. Local maxima separated 
by greater than 10s and having rising phase onsets surpassing a 5% population 
activity threshold from the local minima were set as wave peak and onset times 
respectively. The time between the wave peak and the next local minima when 
population activity during the falling phase decreased to less than 20% was set as 
the wave offset. Each wave period was then interactively confirmed based on the 
visually detected waves in the raw movie data. Any detected calcium transients 
occurring outside of wave periods were excluded from further analysis. 

To detect wave-fronts, an array of spike times (calcium-event onsets) for each 
wave was constructed by creating an MXN P array in which MXN equals the x, 
y dimensions of the movie image and P equals the number of frames within a 
wave. A binary mask for each ROI exhibiting a calcium spike during a wave was 
created at the corresponding time within the array, for the wave to represent 
individual spike times. A binary dilation followed by an erosion operation using a 
structuring element with a height and width three pixels larger than the height 
and width of the ROI contours was then convolved over each wave frame in the 
spike array, and any eight-neighbour connected components in each binary 
frame that were less than two ROI contour areas in size were discarded. To 
calculate the wave-front position for each frame within the spike array for the 
wave, a structuring element that was 2.5 times larger in height and width than 
each ROI was used to perform a binary dilation of the frame. The centroid 
location of the binary connected components within the resulting spike frame 
was then taken to be the wave-front location for that frame. 

For each wave, a single merged wave frame was created by making a maximum 
projection along the dimension P of the binary wave-spike array. The wave area 
was the sum of non-zero areas within the merged wave frame and the major axis 
length of the largest eight-neighbour connected component within the merged 
wave frame was taken to be the distance for that wave. The active fraction for each 
wave of each hemisphere was the number of ROIs participating in a wave divided 
by the total number of ROIs in that recording. The active fraction for each 
hemisphere was the total number of ROIs active at least once during a recording 
divided by the total number of ROIs. Wave speeds were determined by finding the 
set of distances for each wave frame’s centroid relative to the wave-onset centroid 
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and calculating the mean of this set. Wave directions were determined by calculat- 
ing the vector sum of the iterative directions between wave-front centroids in 
sequential frames for each wave and were interactively corrected if the automatically 
calculated wave direction was off by more than 7/4 radians from the visually 
discerned wave angle. Measured wave directions in the left hemisphere were 
mirrored so that the direction values could be compared directly between hemi- 
spheres. Wave frequencies for each recording were calculated as the number of 
waves divided by the length of time for each recording. Inter-wave intervals for 
each recording were calculated as the set of time intervals between the onsets of 
sequential waves in each recording. 

Data analysis. Data sets were analysed using custom routines written in 
MATLAB (Mathworks) and in R (The R Project for Statistical Computing; 
http://www.r-project.org). To quantify temporal correlations between cells 
within a recording, cross-correlation values within a calcium-event onset window 
of +100 ms or +1 frame for each pair of cells in a network was calculated. Interval 
reshuffling was then carried out as described previously’. The significance of 
cross-correlation values for real data sets was determined through comparison 
with distributions calculated from the randomized data sets using a threshold 
significance level of P< 0.05. The distribution of pairwise cell correlations as 
a function of distance was normalized to the distribution of all pairwise cell 
distances to determine the relative pairwise cell-correlation probability density 
(Supplementary Fig. 5). Distribution means were compared using two-sample 
Student’s t-tests or using analysis of variance (ANOVA) followed by pairwise 
t-tests with Holm correction when analysing the effects of multiple grouping 
factors (P<0.05 set as significance). Values are reported as means with the 
standard error of the mean or medians with the median absolute deviation. 

Peri-event time histograms were calculated using wave-onset times as the sti- 
mulus trigger for every ROI of a movie, using an interval window of [— 1,000, 
15,000 ms] around each wave time to find the set of detected calcium-transient 
onsets. The normalized response frequency for each ROI was calculated as the total 
number of detected calcium onsets divided by the number of stimulus triggers for 
each recording. For each ROI of each hemisphere, the normalized anterior—pos- 
terior and medial-lateral distances were calculated as the ROI centroid x and y 
distances from the most anterior—medial location in the dye-labelled craniotomy 
image normalized to the most posterior and lateral locations in the dye-labelled 
craniotomy image. 

For comparison of wave directions in the superior colliculus and visual cortex, 
wave directions in visual cortex were matched to the superior colliculus retino- 
topic map by mirroring and rotating the measured visual cortex wave angles as 
shown in Supplementary Fig. 8. 

To quantify the temporal and spatial correlations of retinal waves occurring at 
the same time in both hemispheres, the distribution of inter-wave onset intervals 
in Supplementary Fig. 1 was fitted to a gamma distribution and the resulting 
shape (2.5177) and rate (0.0387) parameters from the observed distribution were 
used to generate sequences of random wave intervals to which the observed wave- 
interval sequences were compared. For each recording, the observed sequences of 
wave-interval onset times for the two hemispheres was smoothed with a gaussian 
function (sigma = 3 s). A cross-correlation value was then calculated between 
these two smoothed wave-interval spike’ trains and compared with a distribution 
of cross-correlation values calculated from 1,000 random wave-onset simulations 
that consisted of one sequence of observed wave onsets and one sequence of 
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random wave intervals. Overlapping wave onsets observed in a recording were 
determined to be significantly temporally correlated ifthe observed cross-correlation 
value exceeded 95% of the simulated temporal cross-correlation values. 

For determining bilateral spatial correlations, a database of image masks for 
X2.5 CCD recordings was created for every frame between the onset and offset of 
each wave by segmenting the raw AF/F image using Otsu’s method. For each wave 
pair with overlapping wave onsets, the wave image mask for one hemisphere 
was flipped and a two-dimensional cross-correlation was computed between 
the two hemispheres for each frame of a bilateral wave. The maximum value in 
the two-dimensional correlation image was used to calculate the estimated x, y 
displacements between the wave masks for the two hemispheres. A set of 
Euclidean distances was calculated from the x, y displacement to the location 
of the two-dimensional autocorrelation value for one of the hemispheres for each 
wave frame (centre of the wave-mask image) and then the mean Euclidean 
distance was taken as an observed spatial similarity metric (SSM) for that wave 
pair. A second SSM was also calculated for each wave pair by calculating the 
Mahalanobis distance from a set of pattern vectors for the wave masks in each 
hemisphere, with the input pattern vector being an MXN array, in which for 
each frame, M, a set of N pattern metrics was calculated that included the x, y 
displacement values from the two-dimensional correlation, a set of wave-mask 
image texture descriptors (functions ‘graycoprops’, ‘statxture’ and ‘invmoments’ 
in MATLAB Image Processing Toolbox”), and a set of regional measurements 
from the largest component in the binary wave mask (‘area’, ‘centroid’, ‘eccent- 
ricity’, ‘equivdiameter’, ‘extent’, ‘majoraxislength’, ‘minoraxislength’, ‘orientation’ 
and ‘solidity from function ‘regionprops’, MATLAB Image Processing Toolbox). 
For each recording, if the simulated temporal correlation value exceeded 
the observed temporal correlation value, then for each overlapping wave pair a 
random wave from the wave-mask database was aligned to the anterior-medial 
point of the superior colliculus for one of the observed wave masks and then the 
random SSMs were calculated. An observed wave pair was spatially correlated if 
<1% of the random wave pairs had SSMs of less than or equal to the observed 
SSMs for that wave pair. 
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A neural circuit for spatial summation in 


visual cortex 


Hillel Adesnik'}, William Bruns’, Hiroki Taniguchi’, Z. Josh Huang & Massimo Scanziani! 


The response of cortical neurons to a sensory stimulus is modulated by the context. In the visual cortex, for example, 
stimulation of a pyramidal cell’s receptive-field surround can attenuate the cell’s response to a stimulus in the centre of 
its receptive field, a phenomenon called surround suppression. Whether cortical circuits contribute to surround 
suppression or whether the phenomenon is entirely relayed from earlier stages of visual processing is debated. Here 
we show that, in contrast to pyramidal cells, the response of somatostatin-expressing inhibitory neurons (SOMs) in the 
superficial layers of the mouse visual cortex increases with stimulation of the receptive-field surround. This difference 
results from the preferential excitation of SOMs by horizontal cortical axons. By perturbing the activity of SOMs, we 
show that these neurons contribute to pyramidal cells’ surround suppression. These results establish a cortical circuit for 
surround suppression and attribute a particular function to a genetically defined type of inhibitory neuron. 


Visual stimuli located outside of the classical receptive field of a neu- 
ron in visual cortex are unable to elicit spiking, but they may modulate 
the neuron’s response to stimuli located in its receptive field’. 
Surround suppression, a basic operation in visual processing, is a 
classical example of this type of modulation*” and can be easily 
observed when monitoring the firing of a neuron to a stimulus of 
increasing size centred on its receptive field (that is, the size tuning 
of the neuron): the neuron’s initial increase in firing is followed by a 
decrease in firing as the stimulus becomes progressively larger. This 
form of suppression has been suggested to contribute to a number of 
perceptual effects like pop-out, curvature detection and orientation 
discrimination®*'®'*. Importantly surround suppression is not only 
observed in the cortex but is already present at earlier stages along 
the visual hierarchy, namely in the retina’*" and the thalamus'*’*"’. 
Thus, although it is likely that at least part of the suppressive surround 
observed in the cortex is relayed from earlier stages of visual proces- 
sing’’, some experimental observations and theoretical models suggest 
that the cortex is itself capable of contributing to surround suppres- 
sion*’*’, Here we reveal the identity and describe the mechanism of a 
cortical circuit that directly contributes to surround suppression in the 
superficial layers of the primary visual cortex. 


Neuron-type-specific spatial summation 


We determined the tuning to the size of a visual stimulus for neurons in 
the superficial layers of the primary visual cortex (V1; depth ~100- 
350 um, corresponding approximately to layers 2 and 3 (layer 2/3)) of 
mice. Experiments were performed in awake, running animals, as size 
tuning was affected by anaesthesia (Supplementary Fig. 1). Mice were 
head fixed but otherwise unrestrained and free to run on a passive 
circular treadmill. To reduce variability due to differences in behavioural 
state, all data presented here were recorded during running events (see 
Supplementary Methods).Visual stimuli were composed of circular 
patches of drifting gratings at maximal contrast, presented in 6 or 7 
different sizes (from 8 to 97 degrees in diameter, Fig. 1a). The size-tuning 
curve of isolated units** (n = 53; Supplementary Fig. 2), that is, the neur- 
onal firing rate as a function of stimulus size, peaked at 22 + 2 degrees 


(preferred size; + s.e.m.) and progressively decreased with larger stimuli 
(Fig. 1a, d), revealing marked surround suppression”*. The mean firing 
rates (+s.em.) were 0.47+0.11Hz (at the baseline (FRg,)); 
3.0 + 1.0 Hz in response to the smallest stimulus (FRgz); 3.1 + 0.3 Hz 
in response to the preferred stimulus (FRps); and 1.0 + 0.2 Hz in res- 
ponse to the largest stimulus (FR, s). The stimulus modulation index 
(SMI, computed as (FRps — FRgz)/FRps) was 0.87 + 0.03. The suppres- 
sion index, that is, the difference between the peak response and the 
response to the largest stimulus, divided by the baseline subtracted peak 
response ((FRps — FRys)/(FRps — FRpr); Fig. la) averaged 0.9 + 0.1 
(n = 53; the suppression index was statistically significant in 33 out of 
53 units (permutation test); Fig. 1d), indicating substantial suppression 
in response to large stimuli. Infrequent eye movements occurring during 
running had little effect on the size-tuning curve (Supplementary Fig. 3). 

If cortical circuits contribute to surround suppression they may 
involve the suppressive action of cortical inhibitory neurons. An 
inhibitory neuron lacking surround suppression and whose response 
increases with stimulus size would be a good candidate. In cat visual 
cortex, for example, fast spiking inhibitory neurons respond with 
higher firing rates to large stimuli than to small visual stimuli”. We 
carried out targeted loose-patch recordings from inhibitory neurons 
in layer 2/3 of V1 in awake, running mice, using two-photon laser 
scanning microscopy” (Fig. 1b, c). Parvalbumin-expressing neurons 
(PVs)”*, a large class of cortical inhibitory neurons, were visualized in 
layer 2/3 by crossing a PV-Cre mouse line with a td Tomato-reporter 
line. The size-tuning curve of PVs peaked at 57 + 8 degrees (n = 11) 
and showed marked surround suppression with larger stimuli (sup- 
pression index, 0.46 + 0.12, n= 11; suppression index statistically 
significant in 6 out of 11 cells, permutation test (Fig. 1b, e); FRgr; 
9 +2Hz; FRes, 27 + 7 Hz; FRps, 45 + 11 Hz; FRis, 26 + 8 Hz; SMI, 
0.74 + 0.07; recorded PVs showed their characteristic ‘thin’ spike 
shapes (Supplementary Fig. 2), confirming the accuracy of our tar- 
geting strategy~*’’). In contrast to PVs, SOMs, another large class of 
cortical inhibitory neurons” (visualized by crossing a SOM-Cre line with 
a tdTomato-reporter line’*), completely lacked surround suppression 
(suppression index, 0.09 + 0.06; n = 8; suppression index statistically 
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Figure 1 | Differences in spatial summation between three types of neurons 
in the visual cortex. a, Top, schematic of the experimental setup. Centre, 
response of example unit to visual stimuli of three different sizes (a raster plot 
and a peristimulus time histogram (PSTH) are shown). Shaded area behind the 
raster plot represents the period of stimulus presentation. Bottom, average size- 
tuning curve of 53 peak-aligned and normalized single units (6 animals, 11 
recording sessions). Grey triangles with dashed lines, baseline firing rate. Inset, 
average of the normalized but not peak-aligned 53 size-tuning curves. 
SI, suppression index; SMI, stimulus modulation index. b, Top left, schematic 
of the experimental setup. Top right, tdTomato-expressing PV (red) with 


significant in 0 out of 8 cells; permutation test; significantly different 
from PVs; P< 0.03, rank sum test; Fig. 1f). The size-tuning curve of 
these neurons showed a monotonic increase or saturation in firing rate 
with stimulus size (Fig. 1c, f). Although the smallest stimuli were 
relatively inefficient in driving SOMs (FRp,, 7+2Hz; FRss, 
5 + 2 Hz), they robustly responded to large stimuli (FRps, 26 + 2 Hz; 
preferred size, 86 + 3 degrees, different from PV, P< 0.015, rank sum 
test; SMI, 0.75 + 0.05). These data demonstrate that in V1, spatial 
summation can be very different between genetically distinct types 
of neurons. Furthermore, these data suggest that SOMs are potential 
candidates in the generation of cortical surround suppression. 


Excitation of SOMs by horizontal axons 
In response to the observations above, we examined two fundamental 
issues: first, what cortical circuits enable SOMs, in contrast to other 
cortical neurons, to be facilitated rather than suppressed by large 
stimuli; and second, whether SOMs contribute to size tuning in V1. 
The two predominant excitatory inputs to layer 2/3 are vertically 
ascending axons from layer 4 and horizontally projecting axons from 
layer 2/3. To examine whether SOMs are equally excited by these two 
inputs, we recorded from layer 2/3 pyramidal cells, SOMs and PVs in 
coronal slices of V1 and selectively photo-activated (light ramp of 2s 
duration; 480 nm light) layer 4 excitatory cells that conditionally 
expressed channelrhodopsin 2 (ChR2; Supplementary Fig. 4)°’. 
Layer 4 photostimulation generated excitatory charges in SOMs that 
were only 17+5% (+s.e.m.; n=8) of those in simultaneously 
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attached Alexafluor-488-filled recording pipette (green). Center, response of 
PV to visual stimuli of three different sizes (a raster plot and a PSTH are 
shown). Bottom, average size-tuning curve (n = 11 peak-aligned and 
normalized size-tuning curves; 3 animals). Inset, average of the normalized but 
not peak-aligned 11 size-tuning curves. Obj., microscope objective. c, As in 

b but for SOMs (n = 7; 4 animals). d-f, Distribution of SIs (left panels) and of 
preferred stimulus sizes (right panels) for single units (d), PVs (e) and SOMs 
(f). The SOM and PV data are superimposed onto the single-unit data (grey, 
from e) for comparisons. All error bars are s.e.m. 


recorded pyramidal cells (Fig. 2a); in contrast, excitatory charges 
generated in PVs were 250 + 39% (n = 8) of those generated in simul- 
taneously recorded pyramidal cells (Fig. 2b). These results were 
corroborated through ChR2-assisted circuit mapping* (Supplemen- 
tary Fig. 5). Thus, ascending layer 4 axons provide little excitation to 
SOMs. Notably, photostimulation of layer 2/3 pyramidal cells (2 s light 
ramp duration) that selectively expressed ChR2 (see methods*') pro- 
duced substantial excitation of layer 2/3 SOMs (Fig. 2d): the excitation 
that SOMs received was in fact significantly larger than that received by 
simultaneously recorded pyramidal cells (241 + 85%, n = 7, P< 0.05; 
Fig. 2e; we selectively recorded from pyramidal cells that did not 
express ChR2 in order to avoid contamination with photocurrents”’). 
Furthermore, although photostimulation of horizontal layer 2/3 pro- 
jections was accompanied by strong disynaptic inhibition in pyramidal 
cells, very little inhibition was recorded in SOMs (Fig. 2e; the ratio of 
excitation to the sum of excitation and inhibition (E/(E + I)) was 
0.11 + 0.01 (n = 10) in pyramidal cells versus 0.59 + 0.06 (n= 11) 
in SOMs; P< 0.05, Fig. 2f). These results show that although layer 
2/3 pyramidal cells and PVs receive substantial excitatory drive from 
ascending layer 4 axons, the main excitation to layer 2/3 SOMs are 
horizontal axons of layer 2/3. 


Size-dependent excitation of SOMs 


To ascertain whether these horizontal axons are indeed responsible 
for the size-dependent recruitment of SOMs, we took advantage of the 
retinotopic organization of V1 (ref. 32); we reasoned that because 
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Figure 2 | SOMs are selectively excited by horizontal cortical projections. 
a, Left, schematic of the experimental setup. ChR2 is expressed selectively in 
layer 4 (L4) excitatory neurons. Recording electrodes in layer 2/3 (L2/3) target a 
SOM and a pyramidal cell (PC). Right, excitatory currents simultaneously 
recorded in a SOM and a PC in response to photostimulation of layer 4 with a 
ramp of blue light (horizontal blue line). EPSC, excitatory postsynaptic current. 
b, Left, schematic of the experimental setup. As in a but whole-cell recording 
electrodes in layer 2/3 target a PV and a PC. Right, excitatory currents 
simultaneously recorded in a PV and a PC in response to photostimulation of 
layer 4 as ina. c, Summary statistics of the excitatory charge (as a fraction of that 


progressively larger visual stimuli presented in vivo will result in a 
progressively larger visually activated area in V1, we could approximate 
this expansion of activity by directly photostimulating progressively 
larger areas of V1. We carried out loose-patch recordings from SOMs 
in coronal slices of V1 expressing ChR2 in layer 2/3 pyramidal cells 
(Fig. 3a). The firing rate of SOMs increased as a function of the size of 
the light spot (180 to 900 1m), similar to their increase in firing rate 
in vivo with increasing visual stimulus size (Fig. 3a, b). Consistent with 
the increase in firing rate, the synaptic excitation received by SOMs 
increased with increasing light spot size (Fig. 3a, b). If SOM dendrites 
were to span areas similar to the largest light-spot diameter, the pro- 
gressive increase in firing rate with spot size might simply result from the 
direct photostimulation of synapses on the dendritic arborization of the 
recorded SOM. However, this was not the case because even the smallest 
light spot used (180 tm diameter), generating only approximately 25% 
of the maximal firing rate, covered more than 95% of the entire SOM 
dendritic arborization (Fig. 3b, see Supplementary Methods). Thus, the 
increase in SOM firing rate with spot size results from the recruitment of 
progressively more distant L2/3 pyramidal cells (Fig. 3b, see methods). 
Furthermore, cutting horizontal axons with two vertical cuts through 
layer 2/3 on each side of the recorded SOMs (320 + 25 um (+ s.e.m.) 
between cuts, centred on the cell, 1 = 10; note that the distance between 
the cuts is larger than the horizontal dendritic extent of SOMs) prevented 
the increase in firing rate with stimuli larger than the distance between 
the two cuts (Supplementary Fig. 6). Thus, by using horizontal layer 2/3 
projections as their main excitatory drive, SOMs are recruited as a func- 
tion of the activated V1 area; that is, they sum activity in visual space. 
Is the size-dependent recruitment of SOMs a mechanism that could 
contribute to the suppression of pyramidal-cell firing to large stimuli? 
We recorded from pyramidal cells in coronal slices of V1 condi- 
tionally expressing ChR2 in layer 2/3 (again, we selectively recorded 
from pyramidal cells that did not express ChR2). The firing rate of 
pyramidal cells was set to approximately 10Hz by direct-current 
injection (Fig. 3c). A small light spot centred around the recorded 
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simultaneously recorded in the PC) recorded in SOMs (n = 8) and PVs (n = 8); 
P<0.05. d, Left, schematic of the experimental setup. As in a except ChR2 is 
expressed selectively in PCs of layer 2/3. Right, excitatory currents (red, top 
traces) and inhibitory currents (bottom, blue traces) simultaneously recorded 
ina SOM anda PC in response to photostimulation of layer 2/3 with a ramp of 
blue light. IPSC, inhibitory postsynaptic current. e, Left, summary statistics of 
excitatory charge (as a fraction of that simultaneously recorded in the PC) 
recorded in SOMs compared to that recorded in layer 2/3 PCs (n = 7, P< 0.05). 
Right, ratio of excitation to inhibition (expressed as E/E + I) recorded in SOMs 
and PCs (n = 10, P<0.05). All error bars are s.e.m. 


pyramidal cell reduced the firing rate, and this suppression became 
progressively more pronounced as a larger area of layer 2/3 was acti- 
vated by increasing the size of the light spot (Fig. 3c). Consistent with 
the progressive suppression in firing rate, the inhibition received by 
pyramidal cells increased with increasing light-spot size (Fig. 3c,d). 
Finally, to establish that the inhibition generated in pyramidal cells 
after photo-activation of layer 2/3 (ref. 31) was due to the recruitment 
of SOMs* and not any other interneuron type, we optogenetically 
silenced SOMs (see Methods) while monitoring the inhibition in 
pyramidal cells during photoactivation of layer 2/3 (Fig. 3e). 
Photostimulation of layer 2/3 to activate pyramidal cells generated 
strong firing in SOMs and large inhibitory currents in simultaneously 
recorded pyramidal cells, consistent with the results reported above 
(Fig. 3e). Notably, concomitant optogenetic silencing of SOMs (100% 
reduction of firing; n = 6) strongly reduced the inhibitory currents in 
pyramidal cells (80 + 4% reduction (+ s.e.m.);n = 8, P< 0.05, Fig. 3e, f). 

Thus, the stimulus-size-dependent recruitment of SOMs generates 
strong inhibition in layer 2/3 pyramidal cells and efficiently suppresses 
their firing rate (Fig. 3g). 


SOMs contribute to surround suppression 

These data provide a plausible mechanism by which SOMs could 
contribute to surround suppression of layer 2/3 pyramidal cells 
in vivo. Furthermore, under anaesthesia, a situation in which surround 
suppression is compromised (see above and Supplementary Fig. 1), the 
firing rate of SOMs was reduced tenfold (from 26 + 2 Hz, n= 8 to 
2.7 + 0.4 Hz, n= 10), much more than that of single units or PVs 
(Supplementary Fig. 1). This is consistent with a possible contribution 
of SOMs to surround suppression. To test directly for the involvement 
of SOMs in surround suppression, we conditionally expressed the 
light-sensitive hyperpolarizing opsin archaerhodopsin (Arch)* in V1 
using viral injection of a flexed Arch vector*into SOM-Cre mice 
(71+ 2% of cells infected (+s.e.m.), n=4 animals; Fig. 4b and 
Methods). Illumination of the cortical surface efficiently reduced the 
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Figure 3 | Suppression of pyramidal cells by SOMs as a function of the 
activated layer 2/3 area. a, Left, schematic of the experimental setup. ChR2 is 
expressed in a fraction of PCs in layer 2/3. Inset, anatomical reconstruction of a 
biocytin-filled layer 2/3 SOM (dendrites, orange; axon, grey; top dotted line, 
border with layer 1; bottom dotted line, border with layer 4). Right, action 
potentials (black traces, top) recorded in the cell-attached mode in a SOM in 
response to light-spot sizes of increasing diameter. Bottom, excitatory currents 
(red traces) recorded subsequently in the whole-cell voltage-clamp 
configuration in the same SOM neuron in response to the same photo-stimuli. 
EPSC, excitatory postsynaptic current. b, Summary graph for the spiking 
(black; n = 14) and excitatory charge (red; n = 6) of SOMs in response to light 
spots of five different diameters. Orange, summary statistics of the cumulative 
fraction of SOM dendritic arbor length within a sampled horizontal interval 
centred on the SOM cell body (n = 6). Inset, dendrites of the SOM illustrated in 
a but scaled to x axis. c, Left, schematic of the experimental setup. As in a but 
recording from PC. Right, spiking of PC recorded in current-clamp mode 
(black traces) in response to depolarizing current steps while layer 2/3 is photo- 


visually evoked activity of Arch-expressing layer 2/3 SOMs (80 + 1% 
suppression, n = 4, P< 0.05; Supplementary Fig. 7). To determine the 
impact of SOMs on size tuning in layer 2/3, we performed extracellular 
recordings as described above and alternated control trials (visual 
stimulus only) with trials in which SOMs were photo-hyperpolarized 
(Fig. 4). Photo-hyperpolarization of SOMs significantly reduced sur- 
round suppression of layer 2/3 neurons by 30 + 10% (+ s.e.m.;n = 28, 
P<0.00022, paired signed rank test, Fig. 4c—e; photo-hyperpolari- 
zation of SOMs had no significant effect on baseline firing rates, 
—9+17% (+s.em.), n= 13, P> 0.18). Nearly all units (25 out of 
28) showed a decrease in the suppression index (Fig. 4e), and in 10 
out of 25 units the decrease was individually significant (P< 0.05 
permutation test). The reduction of the suppression index was a result 
of SOM photo-hyperpolarization facilitating the response to large 
visual stimuli more than to small visual stimuli: the response ratio 
(the ratio of the firing rate in the illumination condition divided by 
firing rate in the control condition) increased with the size of the 
stimulus (Fig. 4f). Although the response to stimuli smaller or equal 
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stimulated with increasingly large blue light spots (top), and inhibitory currents 
recorded in a PC to the same light stimuli (bottom). IPSC, inhibitory 
postsynaptic current. d, Summary graph of the suppression of firing of PCs 
(black, n = 7) and intracellularly recorded inhibitory charge (blue, n = 6) to 
light spots of five different diameters. Photo-hyperpolarizing SOMs (blue 
arrow; see e) reduce inhibitory charge in PCs. e, Schematic of the experimental 
setup. ChR2 is expressed in a fraction of layer 2/3 PCs and halorhodopsin (halo) 
is conditionally expressed in SOMs. Recording electrodes target a SOM and a 
PC. Full-field blue light activates layer 2/3 PCs, whereas red light suppress 
SOMs. Traces, spikes (black traces, top) recorded in the cell-attached mode in a 
SOM and inhibitory currents (blue traces, bottom) simultaneously recorded in 
a voltage-clamped PC in response to blue light photo-stimulation (blue bar) of 
layer 2/3. Simultaneous illumination with red light (red bar, right panel) to 
photo-hyperpolarize SOMs abolishes SOM firing and reduces inhibitory 
currents in the PC (see also blue arrow in d). f, Summary graph for 
halorhodopsin-mediated reduction of SOM firing (n = 6) and concomitant 
reduction in inhibitory charge in layer 2/3 PCs (n = 8). 


to the preferred size was not facilitated (—7 + 7% (+ s.e.m.), P> 0.45, 
paired signed rank test, n = 28; FRps, cra: = 4.9 = 1.5 Hz, FRps, rep 
= 4.6 + 1.9 Hz, P > 0.63, paired signed rank test), the response to the 
stimuli larger than the preferred size was facilitated by 74+ 19% 
(P < 0.0011, paired signed rank test, n = 28; Fig. 4f). This lack of faci- 
litation of responses to smaller visual stimuli was not due to saturation 
(that is, a ceiling effect). In fact, firing rates to stimuli smaller or equal 
to the preferred one were consistently facilitated less than similar firing 
rates elicited by stimuli larger than the preferred one (Supplementary 
Fig. 8). The stronger impact of SOM photo-hyperpolarization on cor- 
tical responses to large stimuli is thus consistent with the preferential 
activation of SOMs by large stimuli (Fig. 1c). Hence, by inhibiting layer 
2/3 neurons as a function of stimulus size, SOMs generate an inhibitory 
surround (Fig. 4g). 


Discussion 


This study describes a cortical circuit that significantly contributes to 
surround suppression of layer 2/3 cells, and identifies a specific type of 
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inhibitory neuron, the SOM, as a key mediator of this phenomenon. 
This circuit is therefore likely to be involved in the contextual modu- 
lation of cortical responses to visual stimuli. The differential recruit- 
ment of pyramidal cells in superficial layers by ascending inputs 
and of SOMs by horizontal inputs underscores the fact that distinct 
neuron types are differentially integrated in the excitatory cortical 
circuit. These differences lead to different tuning properties, as high- 
lighted here by the distinct size-tuning curves. Thus, although small 
stimuli efficiently drive layer 2/3 pyramidal cells through the activation 
of ascending vertical inputs, by summing activity in space via hori- 
zontal inputs, SOMs are preferentially driven by larger stimuli. As a 
consequence, the larger the stimulus, the stronger the SOM-mediated 
suppression of pyramidal cells. 

An important questions is how SOMs increase their firing as a func- 
tion of stimulus size if they suppress layer 2/3 pyramidal cells, that is, 
their main source of excitation. It is likely that the number of pyramidal 
cells recruited by the outer edge of the stimulus (an annulus that grows 
linearly with the diameter of the stimulus) more than compensate for 
the reduction in pyramidal-cell firing at the centre of the stimulus. 

Photo-hyperpolarization of SOMs reduces but does not abolish 
surround suppression. Although this may partly be due to incomplete 
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Figure 4 | SOMs contribute to size tuning of layer 2/3 pyramidal cells. 

a, Schematic of the experimental setup. b, Section of the visual cortex of aSOM- 
CRE;Rosa-LSL-tdTomato mouse injected with AAV-flexed-Arch-GFP. All 
SOM-CRE cells express tdTomato (red) and infected neurons also express 
Arch-GFP (green). ¢, Size tuning of an isolated unit in control conditions and 
during photo-hyperpolarization of SOMs. Dashed grey line, baseline firing rate. 
Inset, response ratio for this example unit. All error bars are s.e.m. d, Average 
peak-aligned and scaled size-tuning curves for 28 isolated single units 

(3 animals, 7 recording sessions). Black, control conditions; orange, SOM 
hyperpolarization. Inset, average peak-aligned and control-normalized size- 
tuning curve in the SOM-hyperpolarization condition. Note the lack of 
facilitation at the preferred or smaller size. All error bars are s.e.m. e, Scatter plot 
showing SI under control conditions (x axis) plotted against SI under SOM 
hyperpolarization (y axis) for each of the 28 units. Blue data points are units that 
are size tuned at baseline (n = 24, SI>0, P< 0.05). Grey data points are units 
that are not sized tuned at baseline. Stars are units that showed a significant 
reduction in SI (n = 10, P< 0.05). Histograms beside x and y axes show SI 
distribution under control and SOM hyperpolarization, respectively. The 
oblique histogram illustrates the distribution of changes in SI with SOM 
hyperpolarization. f, The ratio of the average response during SOM photo- 
hyperpolarization to the average response under control conditions plotted 
against stimulus size relative to peak. Same units as d. Inset, the same ratio 
without subtracting the baseline firing rate. g, Schematic illustration of the cortical 
circuit in layer 2/3 contributing to surround suppression. As a visual stimulus 
expands (larger stimuli are shown in lighter grey), recruitment of adjacent PCs 
increases SOM excitation through horizontal axons (horizontal arrows). Error 
bars on response ratios (c (inset), f) are estimates of s.e.m. (see Supplementary 
Methods for details). 


silencing of SOMs, surround suppression is probably also relayed to 
cortical layer 2/3 by earlier stages of visual processing’*"'*, and other 
types of inhibitory neurons”or circuits” may also contribute to sur- 
round suppression. 

The preferential recruitment of SOMs by horizontal excitatory 
projections is consistent with these projections having a role in size 
tuning’. In cortical layers with less extensive horizontal connectivity, 
size tuning may rely on different mechanisms”! or may be entirely 
inherited from pre-cortical areas'*"'*. Importantly, because SOMs are 
tuned to the orientation of visual stimuli*®, they could account for the 
orientation dependence of surround suppression*”*’. Furthermore, 
SOMs may respond differentially to specific stimulus properties, such 
as contrast, and thus also contribute to the contrast dependence of 
surround suppression”**””. It is likely that a connectivity pattern sim- 
ilar to what is described here may be present in other cortical areas as 
well, and thus contribute to suppressive surround in several sensory 
and non-sensory modalities. 


METHODS SUMMARY 


Experiments were performed in accordance with the regulations of the 
Institutional Animal Care and Use Committee of the University of California, 
San Diego. Mice were heterozygous for SOM-IRES-CRE (Jackson laboratory 
stock no. 013044) or PV-CRE (no. 008069) (except for the mice used for the 
experiments in Fig. 2a, b) and the reporter allele Rosa-LSL-td TOMATO (Allen 
Institute line Ai9, Jackson Labs no. 007905). For Fig. 2a, b, mice were positive for 
Scnnla-tg3-CRE (Jackson labs no. 009613) and crossed with the Gin (no. 003718) 
or B13 lines. For in vivo experiments, mice were implanted with a custom head 
plate and habituated to head fixation while running on a free-spinning circular 
treadmill. For targeted recording in vivo, tdTomato-expressing neurons were 
visualized by two-photon microscopy and contacted by a glass electrode con- 
taining Alexfluor 488. Extracellular unit recording was carried out using 16-channel 
silicon probes (Neuronexus). Single units were isolated using custom spike sorting 
software (Kleinfeld laboratory). We conditionally expressed ChR2 by in utero 
electroporation (for layer 2/3) or through a CRE-dependent adeno-associated 
virus (AAV) in Scnnla-tg3-CRE (for layer 4). Arch or eNpHR were expressed 
via CRE-dependent AAVs in SOM- and PV-IRES-CRE mice. Visual stimuli were 
generated by custom software (Psych Toolbox) and presented on a gamma- 
corrected liquid crystal display monitor 15 cm from the mouse. Photostimulation 
in vivo was carried out using fibre-coupled light-emitting diodes (LEDs; Doric 
lenses). Photostimulation in vitro was carried out using a combination of fibre- 
coupled LEDs, or LEDs mounted and coupled to an epifluorescence microscope 
(Olympus BX51). eNpHR was activated by a shuttered arc lamp. Slice preparation 
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and intracellular recording followed previous protocols. Data acquisition, visual 
stimulation and statistical analysis was carried out using Igor Pro and Matlab. 
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Unexpectedly large mass loss during the thermal 
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The asymptotic-giant-branch star R Sculptoris is surrounded by a 
detached shell of dust and gas’. The shell originates from a thermal 
pulse during which the star underwent a brief period of increased 
mass loss**. It has hitherto been impossible to constrain obser- 
vationally the timescales and mass-loss properties during and after 
a thermal pulse—parameters that determine the lifetime of the 
asymptotic giant branch and the amount of elements returned by 
the star. Here we report observations of CO emission from the cir- 
cumstellar envelope and shell around R Sculptoris with an angular 
resolution of 1.3”. What was previously thought to be only a thin, 
spherical shell with a clumpy structure is revealed to also contain a 
spiral structure. Spiral structures associated with circumstellar 
envelopes have been previously seen, leading to the conclusion that 
the systems must be binaries’ *. Combining the observational data 
with hydrodynamic simulations, we conclude that R Sculptoris is a 
binary system that underwent a thermal pulse about 1,800 years ago, 
lasting approximately 200 years. About 3 x 10~* solar masses of 
material were ejected at a velocity of 14.3 kms~' and ata rate around 
30 times higher than the pre-pulse mass-loss rate. This shows that 
about three times more mass was returned to the interstellar medium 
during and immediately after the pulse than previously thought. 

The detached shell around R Sculptoris was observed in CO(J = 3 — 2) 
emission at 345 GHz using the Atacama Large Millimeter/submillimeter 
Array (ALMA) during Cycle 0 operations (Fig. 1 and Supplementary 
Information). The data clearly show the well-centred detached shell 
with a radius of 18.5”, and reveal a spiral structure extending from the 
central star outwards to the shell. Previous observations of R Sculptoris 
show structure in the form of clumps. However, this was interpreted as 
clumpy material within the shell itself, and not as a structure interior to 
the shell’. 

Until now no clear signs of binary companions have been observed 
in any detached shell sources (with a possible exception for the 
detached shell around TT Cygnus’). The observed structure around 
R Sculptoris, however, indicates the presence of a companion, shaping 
the stellar wind into a spiral shell structure®. Smoothed particle hydro- 
dynamics (SPH) models show that a wide binary companion can have 
a significant effect in the shaping of the wind, leading to elliptical and 
spiral structures (for example, as observed in the case of the envelope of 
AFGL 3068)°°. 

The observed shapes of the circumstellar envelopes around binary 
asymptotic-giant-branch stars depend on the physical parameters of 
the binary system (such as separation and mass ratio'®), the density 
contrasts imprinted on the wind, the temperatures in the circumstellar 
envelope, the viewing angle, and the chemistry and excitation of the 
gas''. The temporal variations of the mass-loss rate and the expansion 
velocity further affect the structure of the circumstellar envelope. 
Hence, the observed spiral structure and detached shell allow us to 


measure these important properties, and to link them directly to the 
thermal pulse. 

Any change in the expansion velocity of the stellar wind will affect 
the spacing between the spiral windings. In Fig. 2 the spiral can be 
followed from the central star out to the detached shell over about five 
windings. The 2.5 inner windings have a nearly constant spacing, with 
an average distance of 2.6”, implying an essentially constant expan- 
sion velocity during the last 2.5 orbital periods. The expansion velocity 
of the present-day wind’ of RSculptoris gives an orbital period of 
torp = 350 years. The position angle and radius of the observed emis- 
sion then allow us to derive the velocity evolution of the stellar wind 
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Figure 1 | ALMA Early Science observations of the CO(J = 3 — 2) emission 
from the asymptotic-giant-branch star R Sculptoris. The figure shows the 
emission in the different velocity channels. The colour scale gives the flux in Jy 
per beam. The stellar velocity with respect to the local standard of rest (LSR) is 
Visp = —19kms 1. The numbers in the top right corners indicate the velocity 
in kilometres per second with respect to the stellar velocity. The spherical 
detached shell appears as a ring in the individual velocity channels, with its 
largest extent at the stellar velocity. The shell is clearly visible at 18.5” at the 
stellar visp, as well as a spiral structure connecting the central star with the 
detached shell. The structure can be traced through all velocity channels. 
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Technology, SE-43992 Onsala, Sweden. Royal Observatory of Belgium, Ringlaan 3, 1180 Brussels, Belgium. University of Vienna, Department of Astrophysics, Turkenschanzstrasse 17, 1180 Wien, 
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Figure 2 | The CO(J = 3 — 2) emission at the stellar vysp of R Sculptoris. 

a, Position angle versus radius based on the stellar vysp image. The position 
angle starts at due North and increases counter-clockwise. b, The stellar vpsp 
image. The colour scale gives the flux in Jy per beam. The green lines show 
linear fits to the emission peaks (white circles) in a. The first 2.5 windings are 
nearly parallel, with a constant separation of 2.6” + 0.07”, indicating that the 
expansion velocity has been constant (on average) for the last 2.5 binary 
periods. The present-day expansion velocity is estimated to be 10.5kms~', 
giving a binary period of 350 years. The linear fits can hence be translated 
directly into a velocity evolution (Fig. 3). The corresponding spiral is plotted on 
top of the stellar vip image (b). Deviations from a perfect spiral are of the order 
of +1.5kms _", indicating small velocity variations over periods of 50 years. 
Partial spiral arms and arcs connecting the third and fourth windings show a 
larger variation in the wind velocity during these orbital periods on a timescale 
of about 100 years. 


from R Sculptoris from the star out to the detached shell (Fig. 3, and 
Supplementary Information). The derived evolution of the expansion 
velocity since the last thermal pulse is consistent with models of 
thermal pulses*. However, the observed emission implies variations 
in the expansion velocity of +1.5kms_' on timescales ofa few hundred 
years. Observed partial spiral windings and arcs, as well as emission at 
velocities up to 19kms_', indicate brief periods of even larger velocity 
variations. 

A spherically symmetric, detached shell can still be created ina binary 
system where the asymptotic-giant-branch star is undergoing a thermal 
pulse, owing to the brief increase in mass-loss rate and expansion velo- 
city. Collision with the surrounding, slower material will then shape the 
wind into a symmetric shell structure. The post-pulse mass loss leaves 
behind a spiral structure that connects the detached shell with the 
central star. Assuming a spherically symmetric expanding detached 
shell gives a shell expansion velocity of v, = 14.3 km s ‘anda shell 
radius of Rs, = 18.5” (see Supplementary Information). 

The present expansion velocity and size of the shell around 
RSculptoris put the upper limit to the end of the thermal pulse at 
1,800 years ago. With a binary period of 350 years we would expect 
to see around five windings since the pulse, consistent with the 
observed spiral. A decelerated shell would imply a shorter time since 
the thermal pulse, and hence a shorter binary period or fewer spiral 
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Figure 3 | The velocity and mass-loss rate evolution of the stellar wind 
around R Sculptoris. The solid and dashed lines show the velocity and mass- 
loss rate as a function of time, respectively. The points correspond to the 
expansion velocities of the emission peaks in Fig. 2a, assuming a binary period 
of 350 years. The figure shows the evolution of the velocity and mass-loss rate 
since the onset of the last thermal pulse. The velocity profile is a fit to the data 
points, whereas the mass-loss rate profile is constrained by the pre-pulse, pulse 
and present-day mass-loss rates. The profiles are used as input to the SPH 
models. The shape of the mass-loss rate profile is chosen to be consistent with 
the observations. The overall velocity fits the predictions from theoretical 
models of thermal pulses well. However, velocity variations of +1.5km s ‘are 
apparent throughout the evolution since the last pulse, whereas theoretical 
models only predict significant variations in the expansion velocity less than 
200 years after the pulse’. Theoretical models predict an increasing widening of 
the spiral proportional to the local sound speed, which may explain at least part 
of the velocity variation®. 


windings. The sweeping-up of material from the pre-pulse wind could 
cause such a deceleration of the shell. However, although a slight 
decrease in the expansion velocity of the detached shell is possible, 
we find no evidence of a noticeable decrease. This is contrary to the 
current theory of how detached shells are formed during thermal 
pulses**. Also, no material is likely to have piled onto the shell because 
of the post-pulse mass loss. 

Recent images of thermal dust emission show the detached shell, as 
well as a more distant, and spatially distinct, region of interaction with 
the interstellar material’’, indicating the presence ofa stellar wind from 
R Sculptoris before the pulse. Collision with a previous, slower wind is 
required to prevent the thin shell from quickly diffusing*’*. The average 
CO line intensity in the area surrounding the shell sets an upper limit to 
the pre-pulse mass-loss rate, and its ratio to the average CO line inten- 
sity in the shell suggests an increase in mass-loss rate during the thermal 
pulse by a factor of more than ten. The total shell mass is essentially only 
due to the mass lost during the formation of the shell. An estimated total 
gas mass in the shell of 2.5 X 10 ° solar masses (ref. 3) gives a thermal 
pulse mass-loss rate of between 7X 10 ° solar masses per year and 
2.5 10 ° solar masses per year (see Supplementary Information), 
and implies a pre-pulse mass-loss-rate of less than 10 ° solar masses 
per year. The present-day mass-loss rate is estimated to be 3 X 10 7 
solar masses per year (ref. 3), that is, a factor of about 30 lower than 
during the pulse. This general evolution of the mass-loss rate is con- 
sistent with stellar evolution models; however, the ratio between the 
derived pulse and pre-pulse mass-loss-rate is significantly higher than 
found in the models*. 

To further constrain the mass-loss-rate evolution of R Sculptoris, we 
modelled the system with a modified version of the GADGET-2 SPH 
code™, including detailed radiative cooling’. The modelled system 
successfully forms a detached shell, including the observed spiral struc- 
ture (see Supplementary Video). The modelled density, temperature 
and velocity structures of the SPH model are then used as input in the 
three-dimensional radiative transfer code LIME”*. The global morpho- 
logy of the modelled system closely resembles that of the observations, 
and the brightness distribution reproduces the observed intensities 
well (Fig. 4). We effectively constrain the mass-loss-rate evolution 
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Figure 4 | LIME radiative transfer model of the circumstellar structure 
around R Sculptoris. The model is based on the results from the SPH models 
at the stellar velocity. The results of the radiative transfer model have been 
processed by the ‘simdata’ task in CASA (Common Astronomy Software 
Applications) using the ALMA Cycle 0 compact configuration specifications. 
The colour scale gives the flux in Jy per beam. The overall model intensities 
match the observed intensities well, while variations in the intensity contrast 
between the spiral windings and interwinding material indicate a more 
complicated mass-loss rate variation. The inclination angle of the binary system 
to the line of sight is 90°. See Supplementary Information for an animation of 
the SPH models of R Sculptoris. 


throughout the thermal pulse to the present time (Fig. 3). Assuming an 
inter-pulse time of 50,000 years (typical for stars of about 1—4 solar 
masses; ref. 17) and our derived mass-loss rate evolution, 10% of the 
mass lost between two subsequent pulses is expelled during the ther- 
mal pulse, and 40% is lost during the first 1,800 years after the pulse. 
The pulse and immediate post-pulse phases are therefore dominant in 
the formation and chemical enrichment of the circumstellar envelope. 

The chemical content of the expelled material depends critically on 
the physical properties of the pulses (for example, pulse duration and 
interpulse mass-loss rate). The duration of the pulse limits the time for 
nucleosynthesis to occur inside the star'*, whereas the mass-loss rate 
between pulses limits the number of thermal pulses an asymptotic- 
giant-branch star will experience’’. These properties will affect the 
stellar yields of new elements returned to the interstellar medium, as 
well as eventually lead to the termination of the asymptotic-giant- 
branch phase. The observations presented here directly constrain these 
important physical parameters throughout the thermal-pulse cycle. In 
essence, it is the observed spiral structure that allows us to verify model 
results observationally, and refine our knowledge of thermal pulses 
and late stellar evolution. 
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Grain boundaries in graphene are formed by the joining of islands 
during the initial growth stage, and these boundaries govern trans- 
port properties and related device performance’”. Although informa- 
tion on the atomic rearrangement at graphene grain boundaries can 
be obtained using transmission electron microscopy** and scan- 
ning tunnelling microscopy~**, large-scale information regarding 
the distribution of graphene grain boundaries is not easily access- 
ible. Here we use optical microscopy to observe the grain boundaries 
of large-area graphene (grown on copper foil) directly, without 
transfer of the graphene. This imaging technique was realized by 
selectively oxidizing the underlying copper foil through graphene 
grain boundaries functionalized with O and OH radicals generated 
by ultraviolet irradiation under moisture-rich ambient conditions: 
selective diffusion of oxygen radicals through OH-functionalized 
defect sites was demonstrated by density functional calcula- 
tions. The sheet resistance of large-area graphene decreased as the 
graphene grain sizes increased, but no strong correlation with the 
grain size of the copper was revealed, in contrast to a previous 
report’. Furthermore, the influence of graphene grain boundaries 
on crack propagation (initialized by bending) and termination was 
clearly visualized using our technique. Our approach can be used 
as a simple protocol for evaluating the grain boundaries of other 
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Figure 1 | Observation of graphene grain boundaries (GGBs) after 
ultraviolet exposure under moisture-rich ambient conditions. a, Diagram of 
the ultraviolet treatment of a graphene/Cu sample. The copper under the GGBs 
was oxidized by radicals; the lines of oxidized copper were broadened during 
continuing oxidation and thereby became visible using an optical microscope. 


two-dimensional layered structures, such as boron nitride and 
exfoliated clays. 

Optical microscopy is an important tool for characterizing graphene 
at large scales. Although graphene is only one atom thick, a given area 
can have more than one layer growing on it; areas with differing num- 
bers of layers can be distinguished by their contrast difference’®. Optical 
birefringence from transferred graphene covered by a liquid crystal has 
been used to visualize graphene grain boundaries (GGBs), but the grain 
boundaries of the copper substrate were visualized, and looked like 
GGBs’. This last observation contradicts previous reports based on 
scanning electron microscopy (SEM) in which GGBs and copper grain 
boundaries were found to be independent"’. Direct characterization of 
GGBs on copper foil is required. Our work examines graphene grown 
directly on copper without the need for a transfer process, which often 
leads to undesired artefacts, such as additional traces from copper 
substrates, wrinkles and/or cracks’’. 

The primary objective of this Letter is to visualize grain boundaries 
in large-area graphene, and distinguish them from grain boundaries 
of the copper substrate, using an optical microscope. This approach is 
based on the robust oxidization of copper foil at room temperature 
via the selective diffusion of O and OH radicals through the GGBs 
(Fig. la). These radicals are generated using ultraviolet irradiation 


b, c, Optical images of graphene/Cu before (b) and after (c) oxidation. The 

GGBs were visible after oxidation. d, SEM image of oxidized graphene/Cu. The 
GGBs (coincident with oxidized copper seen as white dotted lines) intersected 
the Cu grain boundary (between the dark central region and the surrounding 
grey region). e, AFM force image of the position marked by a red square in c. 
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under moisture-rich ambient conditions. The GGBs are very narrow, 
but the width of the oxidized copper foil beneath the defective GGBs is 
increased by the continuous supply of oxidizing radicals through the 
grain boundary. In addition, oxidized copper occupies a larger volume 
than copper, thereby allowing the visualization of the GGBs using an 
optical microscope (Fig. 1a). 

In general, GGBs are not visible using optical microscopy because 
of the nanoscale width of the boundaries**. Although copper grain 
boundaries were clearly observed in the optical microscopy images 
because of their large width, GGBs were not visible (Fig. 1b). The 
CVD-grown large-area graphene on the Cu foil was used directly 
without transfer in our study (see Methods). After the graphene/Cu 
sample was irradiated by ultraviolet light under moisture-rich ambient 
conditions, the GGBs were visible as thin dark lines. Some of the GGB 
lines crossed over the Cu grain boundaries, demonstrating the forma- 
tion of GGBs regardless of the location of the Cu grain boundaries, 
which is consistent with previous investigations using SEM"’. Similar 
GGBs were also observed as white lines in scanning electron micro- 
scopy (SEM) images (Fig. 1d). The copper grain boundaries were 
clearly distinguishable from the GGBs on the basis of their channelling 
contrast levels (that is, different orientations of the copper crystals 
result in different brightnesses; Supplementary Fig. 1). It is interesting 
to note the different thicknesses of the GGB lines, which implies that 
the nature of the defects in GGBs is complicated. Higher oxygen con- 
tent was observed at the GGBs than within the graphene grains 
(Supplementary Fig. 2). The GGBs in the optical image also matched 
well with those present in the non-contact force image obtained using 
tapping-mode atomic force microscopy (AFM; Fig. le). 

The GGBs visible in the optical image (Fig. 2a) were also confirmed 
using confocal Raman mapping. The CVD-grown graphene/Cu exhib- 
ited a large intensity difference between the G’ and G bands (a high G’/ 
G ratio) with no appreciable D-band (Fig. 2b), thereby demonstrating 
a high-quality monolayer of graphene. With ultraviolet treatment, the 
D-band developed both at the grain boundary and within the grain, 
but occurred more strongly at the grain boundary. This developed 
D-band intensity was the result of the increased oxygen content, which 
increased from 2.8 to 14.2 at.%, as confirmed using X-ray photoelec- 
tron spectroscopy analysis (Supplementary Fig. 3). The blueshift of the 


a Optical image 


800 


600 


400 


Intensity (a.u.) 


200 


c¢ D-band intensity 


3 um 3um 


Figure 2 | Confocal Raman mapping of GGBs. a, b, Optical image (a) and 
Raman spectra (b) of graphene/Cu. The blueshift of the G- and G’-bands in 
addition to the development of the D-band within the grain provide evidence of 
graphene oxidation. The redshift of the G- and G’-band peaks on the grain 
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G-band (1,590 to 1,597 cm ') and the G’-band (2,700 to 2,717 cm ') 
in addition to the D-band development inside the grain after ultra- 
violet treatment is indicative of graphene oxidation'*”’. Interestingly, 
at the grain boundary, both the G-band and the G’-band peaks were 
redshifted compared with those within the grain, respectively from 
1,597 to 1,594cm ? and from 2,717 to 2,688 cm ?. This redshift may 
be explained as follows: in addition to the oxidation of the pristine 
graphene at the grain boundary, the oxidized copper formed under 
the GGB has a greater volume than the pristine copper, and this 
increased volume forced the uppermost graphene to experience a 
tensile strain'*’®. The mappings in D-band, G-band and G’-band 
(Fig. 2c-e) matched well with the GGBs observed using optical micro- 
scopy. We emphasize here that GGBs cannot be observed using con- 
focal Raman mapping of graphene unless prior ultraviolet treatment 
has been applied (Supplementary Fig. 4). 

The observation of GGBs using optical microscopy after ultraviolet 
treatment is possible owing to the morphological changes in the sample. 
Three types of line profile were detected using topological AFM before 
and after ultraviolet irradiation (Fig. 3a): (1) the GGBs; (2) a wrinkle 
caused by a GGB; and (3) a Cu grain boundary. The step-like GGBs 
(with a step height of 2.5 nm) were visualized more clearly after ultra- 
violet treatment (Fig. 3c top), which enhanced the step height to 
approximately 15 nm because of the expanded volume of oxidized Cu 
at the boundary. The width of the oxidized boundary increased to 
approximately 500-600 nm, which is sufficiently large to be observed 
using an optical microscope. The second type of GGB line overlapped 
the wrinkle line and was approximately 5 nm high’*'’, This observation 
is consistent with a previous theoretical report that predicted wrinkles 
might be formed from GGBs"*. After ultraviolet irradiation, the GGB 
was distinguishable because of its increased height (Fig. 3c middle). This 
effect is similar to the previous example but the wrinkle line remained 
unchanged. On the other hand, the Cu grain boundary was initially 
grooved to a depth of 15 nm, and this depth was not altered by oxidation 
(Fig. 3c bottom). This observation supports previous reports of growing 
graphene sliding over the Cu grain boundary without creating defects" 
but contrasts with a report of strongly correlated GGBs and Cu grain 
boundaries’. Our results provide further evidence of the clear distinc- 
tion between GGBs and Cu grain boundaries. Further analysis of the 
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boundary compared with those within the grain was caused by the tensile strain 
developed in the graphene owing to the expanded underlying oxidized copper. 
c-e, Maps of the intensities of the D-band (c) and G-band (d), and the shift of 
the G’-band (e). 
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Figure 3 | Height profiles of various topological GGBs, and the oxidation 
mechanism. a, b, Topological images of various GGBs; (1) GGB, (2) a wrinkle 
caused by a GGB, and (3) a Cu grain boundary. c, Height profiles of the selected 
positions 1-3. d, e, Optical images of graphene/Cu during early growth (for 5 s) 
before (d) and after (e) oxidation. Three distinct regions are identified: (I) a 


morphology of oxidized copper and graphene after ultraviolet irra- 
diation is shown in Supplementary Figs 5 and 6. 

A GGB is typically formed by the joining of adjacent graphene 
islands that were initiated by nucleation seeds. To determine the rele- 
vance of our approach to observing the initial growth stage of graphene 
islands, graphene was grown for only 5s using CVD conditions similar 
to those described in Methods. Three distinct regions near a Cu grain 
boundary line were observed under these conditions (Fig. 3d): a typical 
graphene island (feature I), a highly defective graphene region (feature 
II), and Cu foil with incomplete graphene growth (feature III). The 
black spot in the middle of the grain (Fig. 3d) is a nucleation seed that 
has also previously been observed’’. The graphene/Cu shown in Fig. 3d 
was then subjected to ultraviolet irradiation under humid conditions; 
in Fig. 3e we show the area corresponding to Fig. 3d.It is interesting to 
note the occurrence of divergent grain boundary lines within the island 
centred at the nucleation seed, which is in good agreement with pre- 
vious confocal Raman mapping and TEM observations®"”. The highly 
defective graphene region and the graphene incompletely covering the 
Cu foil were readily oxidized and converted into black spots by the 
facilitation of radical diffusion. Thus, microstructures developed dur- 
ing the initial growth stage can be clearly observed using a simple 
optical microscope. 

The selective oxidation of Cu foil through the GGBs is dependent on 
the oxidation conditions. For example, the humidity level during ultra- 
violet irradiation is crucial in developing clear oxidation patterns. At 
low humidity levels (less than 25%), the grain boundaries were only 
partially visible (Supplementary Fig. 7a—d). Even when the oxidation 
time was prolonged to 30 min at these low humidity levels, GGBs were 
not observed. Pure ozone could not oxidize the graphene or the copper 
substrate at room temperature, as reported previously’*”*. The pre- 
sence of HO was necessary to create OH radicals under ultraviolet 
irradiation’’”’; these radicals are necessary to functionalize graphene 


typical graphene island, (II) a highly defective graphene region, and (III) Cu foil 
with incomplete graphene growth. f, Density functional calculations for 
diffusion of radicals through a heptagon of OH-functionalized Stone-Wales 
defects (SW). Stone-Wales defects were enlarged and deactivated by the 
adsorbed radicals. 


defects to further facilitate copper oxidation (see simulations in 
Methods). At a humidity level of 25%, the black spots began to appear. 
These black spots, which may be impurities introduced during 
the preparation of the Cu foil, acted as nucleation seeds'*. The black 
spots could also be attributable to defects in the graphene. At higher 
humidity levels, the size of the black spots became larger, and the GGB 
lines were clearly visible. However, the size of the black spots and the 
GGB lines did not increase at a greater humidity level (more than 66%) 
or with longer oxidation times (Supplementary Fig. 5e-h). It is impor- 
tant to note that in addition to the oxidation of the graphene (by 
forming epoxide, hydroxyl and carboxyl groups particularly at the defect 
sites), the copper foil was also oxidized to give exclusively Cu(OH), 
(Supplementary Fig. 8). 

We performed density functional calculations to understand the 
oxidation mechanism associated with GGB defects (Fig. 3f, Supplemen- 
tary Figs 9 and 10, Supplementary Table 1). The Stone-Wales defect 
experimentally observed at GGBs** was modelled. The height of the 
barrier experienced by species diffusing through the graphene was 
consistently reduced at the functionalized Stone-Wales defects, in 
particular, those functionalized by OH. It was interesting to see that 
the O radical had a smaller diffusion barrier height than the OH radical. 
In other words, the role of OH radicals was to reduce the diffusion 
barrier height by functionalizing defects. Also, the absolute values of 
the diffusion barrier height could be lowered by taking into account 
effects due to the substrate and excited states (see Methods). 

Although graphene was oxidized and strained by underlying oxidized 
copper, it showed no cracks (in optical micrographs or SEM images) 
after transfer (Supplementary Figs 11 and 12). The sheet resistance of 
transferred graphene also recovered to 426 ohms per square (Q ») 
after heat treatment at 600 °C in vacuum for 4 h, which was comparable 
to the 400Q ' of the pristine graphene (PMMA removed; see 
Methods for details). The recovery of the electrical properties of 
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graphene is often useful for studies of GGB-related intrinsic character- 
istics of graphene. 

To provide proof of concept and to demonstrate the power of our 
approach, several graphene samples were prepared using different 
growth temperatures during CVD. For each sample, the Cu foil was 
annealed for 30 min, followed by 5 min of graphene growth at a given 
temperature. The range of graphene grain sizes was estimated to be of 
the order of micrometres on the basis of TEM examination (Sup- 
plementary Fig. 13). The samples then received an ultraviolet treat- 
ment time identical to that described in the experimental section in 
Methods (10 min). We found that the above-mentioned black spots and 
GGB lines occurred regardless of the growth temperature (Fig. 4a—d). 
The number of graphene grains in an area 57 tm X 76 um was counted 
to obtain an average grain size per unit area. As the growth temperature 
increased, the sheet resistance of the graphene gradually decreased to 
~3002 l fora growth temperature of 1,060 °C, and the average 
graphene grain size increased to ~72 um” (Fig. 4e). The copper grain 
size fluctuated within the range of 10,000-15,000 um?; however, a cor- 
relation between copper grain size and graphene growth temperature 
was not observed. The sheet resistance of the graphene decreased as its 
grain size increased. Other growth conditions with different annealing 
temperatures (while maintaining the same growth temperatures) 
were also evaluated but resulted in similar behaviours (Supplemen- 
tary Fig. 14). Thus, increasing the graphene grain size is a key factor in 
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improving the graphene quality, whereas changing the Cu grain size is 
not effective in altering the sheet resistance of graphene. 

Conductance AFM measurements showed that different GGBs have 
different conductances, and these GGBs were found to be consistent 
with the morphological GGBs observed after ultraviolet exposure 
(Fig. 4g-i and Supplementary Figs 15 and 16). The sheet resistance 
can be fitted to Q = Q,[1 + (A/A,) "], where Q, is the sheet resistance 
of the intrinsic graphene without grain boundary scattering (or an 
infinite grain size), A is the average grain size, A, is a fitting parameter, 
and n is an exponent. We obtained a value of Q, (230 Q 1), which 
to our knowledge has not been estimated previously. The theoretical 
limit of Q, is 30Q ' (refs 23, 24). In our case, the value is larger 
owing to scattering from defects and the substrate within the grain. 
This implies that minimizing defects—such as point defects, wrinkles 
and ripples—and substrate scattering will be important for improving 
conductivity, in addition to enlarging the graphene grain size. 

To demonstrate our method in a more advanced application, we 
visualized fracture propagation on bending, using optical microscopy 
(Fig. 4j-m). Fractures propagated preferentially normal to the strain 
direction and terminated at GGBs (white squares; Fig. 4k). As the 
radius of curvature decreased, more cracks were created and propa- 
gated through GGBs, and in some cases, propagation directions were 
altered at the GGB lines (Fig. 41, m). Our method is not limited to 
graphene, and can be generalized to analyse the defects and grain size 
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Figure 4 | The correlation between GGBs and sheet resistance. 

a-d, Oxidized graphene/Cu grown at indicated temperatures from 1,000 to 
1,060 °C. e, f, Graphene sheet resistance versus graphene grain size for different 
growth conditions. Error bars, s.d. (n = 4). In f, a fitted curve for the sheet 
resistance is arrowed; 2 = Qy[1 + (A/A.) "] (see text for details). 

g-i, Conductance AFM of graphene/Cu before oxidation, showing two 
different types of GGBs. In i, line scans across small (big) grain boundaries are 
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indicated by blue (green) curves. j-m, Fracture propagation in graphene on Cu 
as a function of radius of curvature (d/2, see inset in j). Black dots that appeared 
after oxidation show the position of cracks (red arrows). GGBs are indicated by 
blue arrows. In k and m, the left inset is shown magnified in the right inset. GB, 
grain boundary. Fractures propagated normal to the strain direction, 
terminated (k, 1) and changed direction at the GGBs (m). 
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distribution of other two-dimensional layered structures, such as 
boron nitride (Supplementary Figs 17 and 18) and exfoliated clays. 


METHODS SUMMARY 

Ultraviolet oxidization of graphene on copper. Graphene on copper foil was 
placed into a chamber equipped with a low-pressure Hg lamp (LH-arc, Lichtzen, 
with an output of 20mW cm ”, with the majority of emitted light at a wavelength 
of 254nm and approximately 10% of light at a wavelength of 185 nm)'*. Humidity 
was introduced into the chamber by connecting it to a water bubbler. The humid- 
ity level in the chamber was monitored using a hygro-thermometer (accuracy of 
+3%). The chamber was continuously ventilated throughout the experiments to 
maintain a constant pressure. After reaching the required humidity level, the water 
bubbler was disconnected from the chamber. The graphene/Cu substrate was then 
irradiated with ultraviolet light for 10 min to oxidize the samples. Humid air 
undergoes the following reactions under ultraviolet irradiation’: reactions (1), 
(2) and (4) require ultraviolet light to proceed, reaction (3) is thermally activated. 


O02 O3 (1) 

O3 + H,O > O, + H,0, (2) 
203 + H,O) > 20H" + 30, (3) 
H,0 > H’ + OH’ (4) 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Synthesis of graphene on copper. Graphene was synthesized on 70-pym-thick 
copper foil (Wacopa) using atmospheric pressure chemical vapour deposition 
(APCVD). The temperature in the chamber was elevated to the growth temper- 
ature within 40 min, and the samples were annealed at the growth temperature for 
30 min with gas flows of 200 cubic centimetres at STP per minute (cm? STP min’ ') 
of H, and 1,000cm*STP min‘ of Ar. Monolayer graphene was synthesized 
by flowing S5cm*STPmin ' of H) and 10cm*STPmin ' of CHy with 
1,000 cm? STP min ! of Ar for 5 min. The sample was then cooled to room tem- 
perature while maintaining 1,000 cm? STP min ! of Ar. 

Synthesis of graphene on nickel. The Ni substrate of 300nm thickness was 
deposited on a SiO/Si substrate by an electron-beam evaporator. This substrate 
was placed in a rapid thermal chemical deposition chamber. The temperature was 
increased from room temperature to 950°C over 5 min with H, gas flow. To 
synthesize few-layer graphene, a mixed gas of C,H2/Hp (2/45) and a growth time 
of 1 min were used. After growth, the gas supply was turned off, and the chamber 
was cooled down to 150 °C at a cooling rate of 160 °C min’ *. Around ten layers of 
graphene were synthesized. 

Optical microscopy and Raman spectroscopy. Optical microscopy (100 mag- 
nification, Olympus, numerical aperture = 0.9) was used to obtain images of the 
surface morphologies of the graphene/Cu samples. Two-dimensional confocal 
Raman mapping (CRM 200, Witec) was also performed using a doubled 
Nd:YAG laser (532 nm) with 1 mW power to confirm the optical image results. 
The scan image was obtained at 100 X 100 pixels with a grating of 600 grooves per 
mm to yield a spectral resolution of 5cm~'. The accumulation time for each 
spectrum was 0.3 s for image scanning and 30s for a single spectrum. A sum filter 
was used to extract the D-band (1,300-1,400 cm’ '), G-band (1,540-1,640 cm’) 
and G’-band (2,640-2,740 cm™') distributions after copper background subtrac- 
tion was performed. 

Atomic force microscopy (AFM). The AFM images were obtained using a SPA400 
system (Seiko) in tapping mode. A NSC14-type silicon tip (MikroMasch) with an 
approximately 10-nm tip radius was used. In general, the force constant and 
resonant frequencies of the tips were approximately 5Nm | and 160kHz, 
respectively. The conductance and morphology mappings of graphene monolayer 
on a Cu substrate were carried out using a Raman-AFM system from NT-MDT. 
The system was run in contact mode for both mappings. We used an Au-coated tip 
that had a radius of curvature of approximately 10 nm and an electrical resistivity 
of 0.025Qcm. The force applied to the AFM tip was about 3.2 tN, which was 
under precise control through a feedback loop during the scanning. For the current 
measurement, we applied 4 V d.c. to the Cu substrate and grounded the tip. The 
scan speed was 4.3 ums '. The morphology and electrical current were obtained 
simultaneously at each pixel. The resolution of all the images was 512 X 512 d.p.i. 
During the characterization, room temperature and humidity level were 21 °C and 
50%, respectively. 

Scanning electron microscopy (SEM). A field-emission scanning electron micro- 
scope (FESEM; JSM7000F, Jeol) was used to examine the surface morphology of 
the samples at different accelerating voltages to obtain a high level of contrast at 
different magnifications. An X-Max silicon drift detector was used for energy 
dispersive spectroscopy mapping for a duration of one hour in a FESEM- 
JSM7600F system. An accelerating voltage of 3 keV was used to obtain sufficiently 
high signals while retaining the sensitivity to the sample surface. 

X-ray photoelectron spectroscopy (XPS). XPS was performed using an Al Ka 
X-ray source (XPS, ESCA2000, VG Microtech). C 1s, Cu 2p, O 1s peak data were 
collected to analyse the extent of oxidation of the graphene and the underlying 
Cu foil. 

Transmission electron microscopy (TEM). The graphene was transferred to a 
carbon grid for TEM using two steps: the graphene was first transferred from the 
copper foil to a silicon substrate using the poly(methyl methacrylate) (PMMA)- 
supported layer method and then transferred from the silicon substrate to a TEM 
grid using the direct transfer method”. PMMA liquid (MicroChem, 950 PMMA 
C4) was spin-coated onto graphene/Cu at 1,000 r.p.m. for 60 s. The copper foil was 
then etched away using an FeCl, solution for 40 min. The PMMA/graphene was 
then placed onto SiO, after rinsing with deionized water, and the PMMA was 
removed using acetone. Isopropanol was then dropped on the graphene/SiO, 
surface, and the TEM grid was attached onto the graphene/SiO>. After drying, 
the graphene strongly adhered to the TEM grid. A diluted HF solution was 
then used to slowly etch the SiO, and leave the TEM/graphene floating on the 
solution surface. After rinsing off HF with deionized water, the graphene/TEM 
grid was dried for 5 min before collecting TEM images. A TEM grid with a carbon- 
supported thin film (PELCO, 200 mesh, carbon type B) was used to collect select- 
ive area electron diffraction (SAED) patterns. The use of the film-type TEM grid 
was necessary to reduce the possibility of breaking the monolayer graphene. The 


SAED pattern was collected at 1.2 jum, that is, the maximum aperture size, using an 
HR-TEM instrument (JEM2100F, Jeol) at 200 keV. 

Sheet resistance measurement. The graphene was transferred onto a silicon wafer 
using a PMMA-supported layer as previously described. For non-destructive tests, 
ultraviolet-treated graphene/Cu was transferred to a SiOz substrate.The resis- 
tance was measured using the four-point probe method and a Keithley 2000 
Multimeter. 

Density functional theory (DFT) calculations. We performed density functional 
theory calculations within generalized gradient approximation”® as implemented 
in DMol3 code”. All electron Kohn-Sham wavefunctions were expanded in a local 
atomic orbital basis set with each basis function defined numerically on an atomic 
centred spherical mesh. Double numeric polarized basis sets with polarization 
were used for all elements. We used slab geometry of 8 X 8 repeating graphene 
units in x- and y- directions, containing 128 carbon atoms, and applied a periodic 
boundary condition in three dimensions. After cell relaxation, the supercell size 
becomes 19.68 A X 19.68 A and 10 A of vacuum in the z-direction was used. The 
maximum force allowed during geometry optimization was 0.1 eV A”, the max- 
imum displacement was 0.005 A and the total energy change was 5 X 10° eV. The 
damped atom-pairwise dispersion corrections of the form CsR’° were also con- 
sidered for calculations**. A gamma point irreducible Monkhorst-Pack k-point 
grid sampling” was used for structural relaxations. The energy convergence 
was checked using a more refined 4 x 4 X 1 k-point sampling, and the energy 
difference was less than 1 meV per atom. Then a Stone-Wales (SW) defect was 
generated and the geometry was again optimized within the criteria mentioned 
above. The formation energy of an SW defect was 5.13 eV. For the geometries with 
adatoms, we added an adatom onto the graphene surface with an SW defect at an 
intended position and then the geometry was optimized repeatedly for every 
adsorption point. 

From fully optimized geometries of reactants and products, we considered 
diffusion pathways of the penetration of H and O atoms or an OH molecule. In 
order to find the diffusion barriers, transition state search routines of the linear 
synchronous transit (LST) method” and the quadratic synchronous transit (QST) 
methods were used. LST started with a single LST maximization, bracketing the 
maximum between the reactants and product, and was then followed by an energy 
minimization in directions conjugate to the reaction pathway. This yielded a 
structure lower in energy and closer to the true transition state than a simple 
LST method. Minimization continued until an energy minimum was reached or 
the number of conjugate directions was exhausted. The LST approximation 
obtained in that way was used to perform a QST maximization. The QST method 
interpolated the reaction pathway among three structures; an intermediate 
geometry was required in addition to the reactant and product structures. From 
the QST maximized point, another conjugate gradient (CG) minimization was 
performed. The cycle was repeated until a stationary point was located or the 
number of allowed QST steps was exhausted. By calculations of the LST-CG then 
the QST-CG routine repeatedly, the transition state can be defined. 

In order to check the Cu substrate effect, we calculated the diffusion barrier 
height through OH-functionalized SW defects in the presence of copper substrate. 
This was done by placing a Cu(111) surface below the graphene layer. We first 
optimized the Cu bulk system using a PBE functional with 12 x 12 x 12 
Monkhorst-Pack k-point grid sampling, and the pseudopotential basis sets. 
From the optimized bulk Cu, we generated two layers of Cu(111) surface with 
8 X 8 surface units. We put in a considerable size of vacuum in order to add 
graphene with admolecules for further calculations. The supercell size was 
19.68 X 19.68 X 20.00 A, the same as that of graphene in plane directions. The 
Cu(111) surface was under compressive strain of 4.8%, due to the lattice mismatch 
between the Cu(111) surface and graphene. The graphene layer was placed 3.2 A 
above the Cu layer. The electron density for transition states obtained without a Cu 
layer was optimized again in the presence of a Cu surface without relaxing ions. 
The total energy difference between the transition state and the reference state 
geometries has been defined as the diffusion barrier for each species diffusing. 
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April 2012 intra-oceanic seismicity off Sumatra 
boosted by the Banda-Aceh megathrust 


Matthias Delescluse!, Nicolas Chamot-Rooke’, Rodolphe Cattin?, Luce Fleitout', Olga Trubienko! & Christophe Vigny! 


Large earthquakes nucleate at tectonic plate boundaries, and their 
occurrence within a plate’s interior remains rare and poorly docu- 
mented, especially offshore. The two large earthquakes that struck 
the northeastern Indian Ocean on 11 April 2012 are an exception: 
they are the largest strike-slip events reported in historical times’? 
and triggered large aftershocks worldwide’. Yet they occurred 
within an intra-oceanic setting along the fossil fabric of the extinct 
Wharton basin, rather than on a discrete plate boundary* *. Here we 
show that the 11 April 2012 twin earthquakes are part of a continu- 
ing boost of the intraplate deformation between India and Australia 
that followed the Aceh 2004 and Nias 2005 megathrust earthquakes, 
subsequent to a stress transfer process recognized at other sub- 
duction zones”’®. Using Coulomb stress change calculations, we 
show that the coseismic slips of the Aceh and Nias earthquakes 
can promote oceanic left-lateral strike-slip earthquakes on pre- 
existing meridian-aligned fault planes. We further show that per- 
sistent viscous relaxation in the asthenospheric mantle several years 
after the Aceh megathrust explains the time lag between the 2004 
megathrust and the 2012 intraplate events. On a short timescale, 
the 2012 events provide new evidence for the interplay between 
megathrusts at the subduction interface and intraplate deformation 
offshore. On a longer geological timescale, the Australian plate, 
driven by slab-pull forces at the Sunda trench, is detaching from 
the Indian plate, which is subjected to resisting forces at the 
Himalayan front®*". 

The first and largest shock of 11 April 2012 (moment magnitude 
My 8.6) occurred 330 km west-southwest of the Aceh 2004 epicentre, 
but only 120 km southwest of the subduction front. It was followed 2h 
later on the same day by an M,, 8.2 twin earthquake, located some 
180 km south of the main event. The oceanic fabric there is characterized 
by a set of more or less north-south-oriented fracture zones, well 
captured by satellite-derived gravity measurements (Fig. 1) and 
mapped in situ using multi-beam bathymetry’. These are fossil trans- 
form faults that were active during the opening of the Wharton basin 
that shut off some 45 Myr ago. Recent reactivation of these faults’* was 
confirmed by seismic reflection profiling showing basement and sedi- 
ment offsets as well as sea-floor expression’*. The epicentres of the two 
11 April 2012 events align along a 022° direction, which is within a few 
degrees of one of the nodal planes for both earthquakes. 

The two twin shocks thus seem to have involved slip on one of these 
very long fossil fracture zones that can be followed for several hundred 
kilometres®. The perpendicular normal fault orientation only offers 
limited continuous fault length to the rupture as it is offset by fracture 
zones'*. However, we cannot rule out slip on the conjugate plane’* as 
such slip happened further south in the Wharton basin in the year 
2000’. For the M,, 8.6 event, aftershocks (Fig. 1) and back-projection 
analysis indicate the activation of N110°E planes’*’”. The entire 
oceanic fabric, normal faults included, seems to be somewhat reacti- 
vated by these complex events. 

The 11 April 2012 earthquake sequence is located in an area previ- 
ously recognized as part of the diffuse deformation zone between the 
Indian and Australian plates'*. Intraplate deformation is active on 
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Figure 1 | Present-day kinematics of the India—Australia plate. The 11 April 
2012 M,, 8.6 and My 8.2 earthquakes, shown here by their blue and white focal 
mechanisms (‘beachballs’), occurred in the Wharton basin (WB), at the heart of 
the diffuse plate boundary between India and Australia. Ninetyeast Ridge 
(NYR), which is the Kerguelen hotspot trail, separates the Wharton basin from 
the central Indian basin (CIB). Off trench, the background is the 120 km high- 
pass-filtered satellite free-air gravity anomaly, illuminating meridian aligned 
fracture zones. Three convergence vectors are displayed east of the Sumatra- 
Andaman trench, calculated at the location of the 2004 Aceh epicentre. The 
blue vector is India/Sunda vector (IN/SU) predicted by the MORVEL” global 
model (46 mmyr_ | towards N22°E), while the green vector is the Australia/ 
Sunda prediction using the same model (54 mmyr_* towards N8°E). The true 
convergence vector is in between, since the subducting plate is neither India nor 
Australia. The black vector (56 mm yr‘ towards N20°E) is an estimation of the 
convergence rate that takes into account a non-rigid India-Australia plate. This 
non-rigid crustal velocity field is shown as red vectors with their 95% error 
ellipses at knot points west of the trench line. The corresponding strain-rate 
field is shown as green double arrows. At the latitude of the April 2012 
earthquake sequence, motion of the Wharton basin with respect to India 
progressively increases west to east, from negligible close to Ninetyeast ridge to 
10mmyr ' close to the trench. 
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both sides of Ninetyeast Ridge, predominantly as reverse faults with 
north-south compressive axes (P-axes) west of it’?! and strike-slip 
faults with northwest-southeast P-axes east of it*®. Large-scale folding 
of the oceanic lithosphere was found both in the equatorial Indian 
Ocean and in the Wharton basin, and interpreted as a direct con- 
sequence of large in-plate stresses’. The present-day kinematics of 
the Indian and Australian plates is now well established, based on 
Global Positioning System (GPS) measurement away from the deform- 
ing zone. Following the approach of Holt and Haines** (Methods), we 
have derived a self-consistent non-rigid velocity field® (Fig. 1) that 
combines far-field boundary conditions (GPS over Australia and 
India, plus a few islands within the deforming zone) and local con- 
straints from the deformation zone itself (style of deformation from 
focal mechanisms and heat flow as a proxy for strength). Offshore from 
Sumatra, the motion exhibits a gradual south-north evolution from the 
Australian plate velocity to the Indian plate velocity. The predicted 
amount of shear across the set of meridian aligned faults is of the order 
of 10mmyr |. 

Intraplate seismicity has been boosted by the Aceh and Nias ruptures. 
We show in Fig. 2 a chrono-spatial chart of the seismicity of the 
Wharton basin, covering the 8 years before and the 8 years after the 
Aceh earthquake. An increase in the rate of seismicity immediately 
followed the Aceh and Nias earthquakes, especially for the portions 
of oceanic lithosphere experiencing peak slip rate on the failure plane 
of the Aceh megathrust. The Nias earthquake that followed the Aceh 
earthquake triggered oceanic earthquakes in a region that remained 
silent during the time lag of three months between the two events. 
Wealso estimated the net increase in intraplate seismic moment release. 
On the basis of a century-long catalogue®, the mean moment release for 
the entire deforming Indian Ocean (including the central Indian basin 
and the Wharton basin) was about 2.3 X 10'°N myr | before the Aceh 
earthquake, two-thirds of it being released in the Wharton basin. In the 
year 2005, this number jumped to 1.2 x 107°Nm for the Wharton 
basin only. For comparison, seismic moment release for the two 11 
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April earthquakes is above 1.4 X 10°* Nm, which is about 1,000 times 
the yearly release estimated from the century-long catalogue. 

We compiled focal mechanisms of 47 earthquakes (My > 5) that 
occurred off Sumatra in the oceanic plate from December 2004 (Aceh 
earthquake) until 15 April 2012 (from the Global CMT catalogue). 
We excluded earthquakes within the slab portions already engaged in 
subduction, although many of them are clearly related to intraplate 
deformation and reactivation of the oceanic fabric’, such as the 
Padang earthquake of 2009 (M,, 7.6, depth >80km). The post-Aceh 
oceanic earthquakes fall into two categories: normal faults at the external 
wall of the trench (18 events), and earthquakes with a northwest- 
southeast P-axis corresponding to the release of India-Australia intra- 
plate stress (29 earthquakes). Among the compressive events, only one 
seems unrelated to the oceanic fabric, the other 28 being strike-slip 
earthquakes with one left-lateral meridian aligned nodal plane 
(015° + 005°). The very consistent orientation of the nodal planes over 
a wide region is strong evidence for reactivation of the oceanic fabric in 
the area. 

To test the effect of the Aceh 2004 and Nias 2005 ruptures on the 
reactivation of the Wharton basin oceanic fabric, we calculated the far- 
field static Coulomb stress change ACFF = At — yt’ Ao, resulting from 
the coseismic displacement on the subduction interface, up to 250 km 
seaward of the trench axis. Here At is the static shear stress change on 
the failure planes (positive in the direction of fault slip), Ao, the static 
normal stress change (positive if the fault is clamped) and yw’ the 
effective friction coefficient (see Methods and ref. 23). A positive 
Coulomb stress change favours rupture. 

Calculations were first performed to resolve stress onto a N15°E- 
oriented vertical plane (Fig. 3a). The Coulomb stress change is calcu- 
lated at 18 km depth, which is the average depth of the earthquakes 
before 2012 and an upper limit for the centroid depth of the mainshock 
of 11 April 2012 (18-27 km; ref. 14). We find that all strike-slip earth- 
quakes occurred in a lobe of positive Coulomb stress change. The area 
of highest Coulomb stress change in front of the 2004 Aceh coseismic 
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Figure 2 | Oceanic intraplate seismicity boost after the Aceh (2004) and 
Nias (2005) earthquakes. a, Map representation; b, chronological chart. 

a, Map representation of the same intraplate seismicity as b. To ensure the 
intraplate nature of the earthquakes, events 50 km off-trench are shown as 
larger dots. b, Year 0 corresponds to 26 December 2004. Red (after December 
2004) and blue (before December 2004) events are exclusively located over the 
oceanic plate west of the trench line and up to 250 km away from the trench 


Nias 


April 2012 
Year 


line, whereas green points represent the entire seismicity, including the upper 
plate. All earthquakes were extracted from the NEIC catalogue. Focal 
mechanisms of post-Aceh events with magnitudes greater than 5 occurred after 
December 2004 are also displayed. Blue focal mechanisms correspond to the 
April 2012 earthquake sequence, and black focal mechanisms to earthquakes 
that occurred in the same area since the Aceh event. 
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Figure 3 | Static Coulomb stress change calculation offshore the Sumatra- 
Andaman subduction. a, Each colour pixel represents a N15°E-oriented 
vertical fault plane. Note that the slip colour scale is adapted to the Aceh 
earthquake slip distribution, and requires division by a factor of three to obtain 
the Nias earthquake slip values. b, The Coulomb stress change has been tested 


slip corresponds to a shear stress increase and to a normal stress 
decrease. This situation favours left-lateral strike-slip on the N15°E 
vertical faults independently of the value of the effective friction co- 
efficient (unclamping). We further tested the effect of the azimuth of 
the vertical planes and the variation of the Coulomb stress change with 
depth (Fig. 3b). The mainshock of 11 April 2012 locates the exact limit 
where N20°E planes are still positively favoured by the Aceh 2004 slip. 
The Coulomb stress change is much higher if we consider that the M,, 
8.6 event nucleated on a N110°E plane with dextral motion (see Sup- 
plementary Information), as suggested by seismological studies'®’”. 
Depth does not really affect the result for the considered 5-45 km 
range. The 2005 Nias rupture also favours the occurrence of 
earthquakes on N15°E-oriented vertical planes, but focal mechanisms 
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for different depths and azimuths of the fault planes at three different locations, 
from north to south: Z1 (30 June 2005 earthquake location), Z2 (1 January 2005 
earthquake location) and Z3 (11 April 2012 mainshock location). White bars 
bound the azimuths of the oceanic fabric. Slip on the oceanic fabric is best 
favoured to the north of the area. 


there indicate normal rather than strike-slip events. Most of the 
normal faults oriented along the trench strike are located in areas of 
positive Coulomb stress change for their respective receiver fault ori- 
entation (see Supplementary Information). Slip on newly formed 
thrust faults perpendicular to the intraplate P-axis are inhibited by 
the Aceh earthquake coseismic stresses (see Supplementary Informa- 
tion). Over the long term, only a few reverse faults were active during 
the past century’, as stresses apparently rarely reached the level 
required to create a new fault. 

The Coulomb stress change distribution is highly sensitive to the 
coseismic slip distribution. The high sensitivity of the model to shallow 
slip, in particular, may partly be due to assumptions underlying the 
Coulomb model, such as a flat-plane subduction interface and an 
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homogeneous elastic half-space. Many coseismic solutions (see, for 
example, refs 24-29) have been proposed for the Aceh and Nias earth- 
quakes based on the inversion of various data sets, including GPS 
displacements, coral reef observations, long-period teleseismic data, 
tide-gauge records and satellite altimetry measurements. We use here 
the slip distributions obtained in refs 24 and 25, for the Aceh and Nias 
earthquakes, respectively. First-order properties of the slip distribu- 
tions obtained in these two models are consistent with several other 
studies**°. Our best choice for slip distribution was however driven 
by one significant observation: triggered intraplate earthquakes, 
both normal (see Supplementary Information) and strike-slip, occur 
extremely close to the toe of the trench. Our sensitivity tests showed 
that only slip distributions including shallow, near trench displace- 
ment successfully produce a positive Coulomb stress change close 
enough to the trench’. Most coseismic slip inversions are rather 
insensitive to shallow displacements at the toe of the subduction, 
which is generally too far from the land stations where measurements 
are available. They thus tend to minimize the amount of shallow 
rupture there. However, marine constraints*® (2011 Tohoku M,, 9.0 
earthquake, Japan), and geodetic measurements close to the failure 
plane*’ (2010 Maule M,, 8.8 earthquake, Chile) have shown that the 
coseismic displacement of giant earthquakes does reach the toe of the 
trench. 

A clear drawback of the Coulomb stress calculation is that it fails to 
model time-dependent stress transfers because the underlying rhe- 
ology is elastic. In these elastic models, promoted faults should break 
shortly after the megathrust event, as the extensional stress within the 
oceanic plate off the trench begins to decrease immediately after the 
earthquake with the start of a new interseismic loading cycle. Yet the 
time lag between Aceh and the 2012 earthquake sequence is more than 
7 years, which indicates another mechanism producing a northeast- 
southwest unclamping that is larger than the slow buildup of northeast- 
southwest compressive stress during interseismic loading. Transient 
triggering may be related to the viscoelastic properties of the astheno- 
sphere. In Supplementary Information, we present a simple two- 
dimensional viscoelastic model of the earthquake cycle that predicts 
the time evolution of the stresses linked to subduction earthquakes as a 
function of distance to the trench. The observed time lag between the 
Aceh and Nias earthquakes and the April 2012 earthquake sequence is 
found to be compatible with a rheology of the asthenosphere indepen- 
dently required to explain near field to far field GPS constraints in the 
Sunda block, on the other side of the trench”. The maximum post- 
seismic relaxation stress is reached between 7 and 10 years after the 
megathrust in the area of the 2012 events. The further from the trench, 
the longer the time lag until maximum post-seismic relaxation stress is 
reached. Similar three-dimensional finite element viscoelastic calcula- 
tions*’, using a more realistic rupture zone geometry, further confirm 
that at the locations of the two events of April 2012, the net stress effect 
is unclamping in the northeast direction, in agreement with the two- 
dimensional viscoelastic model and the Coulomb stress change calcula- 
tions presented here. 

The two large earthquakes of 11 April 2012 are typical of the intraplate 
deformation between the Indian and Australian plates, the fossil fabric of 
the oceanic lithosphere being re-activated across the entire Indian 
Ocean'*?'. The mega-earthquakes of Aceh and Nias acted as instant- 
aneous boosters, suddenly illuminating where and how intraplate 
deformation was at work. Viscous relaxation in the asthenospheric 
mantle is responsible for short-term stress building, still contributing 
several years to several tens of years after the megathrust events. Long- 
term stress building there"’ is the direct consequence of the high mech- 
anical coupling of the Indian plate to the Eurasian plate at the Himalayan 
front that started some 8 Myr ago. Dense oceanic slabs engaged at the 
Sunda trench inexorably drive the Australian plate northward, while this 
motion is resisted at the Himalayas. The long-term scenario is that a 
nascent plate tectonic boundary is forming: the Australian plate is 
becoming detached from the Indian plate. 
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METHODS SUMMARY 


Kinematic modelling. The kinematic model is detailed elsewhere’. It includes 
constraints from far-field GPS measurements and style of deformation from a 
century-long intraplate earthquake catalogue. 

Coulomb stress change calculations. Coseismic stresses are computed using 
megathrust slip distributions described as displacement discontinuities in an 
homogeneous elastic half-space”. The shear and normal stresses are then 
extracted for a chosen plane geometry that is supposed to pre-exist in the entire 
oceanic lithosphere. 

Post-seismic relaxation. A two-dimensional finite element model featuring vari- 
able rheological parameters is used (Z-set/Zébulon; see Methods). The rheological 
parameters fit the upper plate GPS constraints. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Intraplate kinematics using the Haines and Holt method. The Haines and 
Holt” method derives continuous velocity and strain-rate fields by interpolating 
modelled velocities that are fitted in a least-squares sense to GPS velocities. A 
3° X 3° grid was defined to cover the entire India-Australia plate. The cells located 
within the rigid Australia plate were not allowed to deform, as opposed to the 
equatorial Indian Ocean and the Indian continent. Cells are allowed to deform 
through an anisotropic strain-rate variance. Focal mechanisms are used to define 
the anisotropic variance so that the direction of the strain-rate field is controlled by 
the principal axis of deformations from observed seismic moment tensors. Notice 
that only the directions are used, as the magnitude and sign of strain-rate along the 
principal axis are not an input to the model. The general level of variance in cells is 
also a proxy for the rheology of the lithosphere. Non-uniform variance allows 
localization of deformation where the variance is high. In our model, heat flow is 
used as the proxy for the rheology of the lithosphere. As a result, cells including 
high heat-flow measurements are allowed to deform more. Details of the model 
and the earthquake catalogue can be found elsewhere’. 

Coulomb stress change calculations. We describe the two rupture zones of the 
2004 and 2005 megathrust events by surfaces of displacement discontinuities in 
isotropic homogeneous elastic half-space. Details of the properties of slip distribu- 
tions can be found elsewhere***>**. Each dislocation induces a three-dimensional 
stress change field, which is calculated from the analytical solution of ref. 35 using a 
Poisson ratio v = 0.25 and a Young’s modulus E = 75 GPa. The Coulomb stress 
change is calculated on specified oriented vertical planes. To study the effect of the 
considered nodal planes, we assume two different receiver-fault orientations: 
N15°E and N110°E. 

The effective friction coefficient 4’ depends on the friction coefficient and 
pore-fluid migration properties related to the Skempton coefficient B on the failure 
plane*®. Laboratory experiments typically find values for 1 of around 0.6 to 0.85 for 
most rock material, apart from those rich in clay minerals*’. The Skempton co- 
efficient is a less well-known parameter ranging between 0.4 and 0.9 for granite, 
sandstone and marble**, but still unconstrained for other rocks. Values of ' 
between 0 and 0.75 are commonly assumed”. When yp’ is high, the pore pressure 
does not strongly affect the normal stress. At the other extreme, when ju’ = 0, the 
rock is so saturated that the pore pressure annihilates the effect of the normal stress 
on the plane. Following the commonly used values*’’ and our previous studies”, 
here we assume an effective friction coefficient of 0.4. 

The threshold for a Coulomb stress increase that affects seismicity is still a 
matter of debate. Some studies*'” propose a threshold of 0.01 MPa. Here colour 
scales are saturated for a Coulomb stress change of 0.1 MPa 
Post-seismic relaxation. A two-dimensional finite element model featuring vari- 
able rheological parameters is used (Z-set/Zébulon*’). The model considers not a 
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single earthquake but a seismic cycle with repeating periodic earthquakes“, with a 
period chosen equal to 170 yr. 

An earthquake is modelled as a 10-m sudden displacement over the subduction 
interface. The thickness of the elastic lithosphere is chosen equal to 50 km. This 
value is intermediate between the thermal thickness and the flexural elastic thick- 
ness, appropriate for mimicking the lithosphere response for long timescales and 
large stress perturbations*. The asthenosphere extends between depths of 50 and 
170km. The asthenosphere deforms either as a Maxwell viscoelastic solid with a 
viscosity of 3X 10'* Pas or as a Burgers body (Kelvin-Voigt with viscosity 
n=3X 10'* Pas, transient shear modulus si = slg1a,/3, and long-term viscosity 
3 X 10’? Pas). The Kelvin-Voigt viscosity is used to model the short-term viscous 
response of the asthenosphere. The mantle below the asthenosphere has a viscosity 
of 10°! Pas (of little impact on our results). Supplementary Fig. 3 only presents the 
central part of the finite element mesh. The computation is performed for a 
domain that extends horizontally from -4,000 to +4,000km from the trench 
and from the Earth’s surface to 1,500 km depth. 
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En echelon and orthogonal fault ruptures of the 
11 April 2012 great intraplate earthquakes 


Han Yue’, Thorne Lay' & Keith D. Koper? 


The Indo-Australian plate is undergoing distributed internal 
deformation caused by the lateral transition along its northern 
boundary—from an environment of continental collision to an 
island arc subduction zone’”. On 11 April 2012, one of the largest 
strike-slip earthquakes ever recorded (seismic moment magnitude 
M,, 8.7) occurred about 100-200 kilometres southwest of the 
Sumatra subduction zone. Occurrence of great intraplate strike-slip 
faulting located seaward of a subduction zone is unusual. It results 
from northwest-southeast compression within the plate caused by 
the India-Eurasia continental collision to the northwest, together 
with northeast-southwest extension associated with slab pull stresses 
as the plate underthrusts Sumatra to the northeast. Here we use 
seismic wave analyses to reveal that the 11 April 2012 event had an 
extraordinarily complex four-fault rupture lasting about 160 seconds, 
and was followed approximately two hours later by a great 
(M,, 8.2) aftershock. The mainshock rupture initially expanded 
bilaterally with large slip (20-30 metres) on a right-lateral strike- 
slip fault trending west-northwest to east-southeast (WNW-ESE), 
and then bilateral rupture was triggered on an orthogonal left- 
lateral strike-slip fault trending north-northeast to south-southwest 
(NNE-SSW) that crosses the first fault. This was followed by west- 
ward rupture on a second WNW-ESE strike-slip fault offset about 
150 kilometres towards the southwest from the first fault. Finally, 
rupture was triggered on another en échelon WNW-ESE fault 
about 330 kilometres west of the epicentre crossing the Ninetyeast 
ridge. The great aftershock, with an epicentre located 185 kilometres 
to the SSW of the mainshock epicentre, ruptured bilaterally on a 
NNE-SSW fault. The complex faulting limits our resolution of the 
slip distribution. These great ruptures on a lattice of strike-slip faults 
that extend through the crust and a further 30-40 kilometres into the 
upper mantle represent large lithospheric deformation that may 
eventually lead to a localized boundary between the Indian and 
Australian plates. 

It has long been recognized from relative plate motions that the 
Indo-Australian plate is not behaving as a single rigid unit’. 
Earthquake focal mechanisms, plate spreading rates inferred from 
magnetic lineations, fracture zone orientations, seismic stratigraphy, 
folds and sedimentary unconformities, and geodetic observations indi- 
cate diffuse internal deformation of the plate over a broad equatorial 
region (Fig. 1). This region extends from the central Indian ridge near 
the Chagos bank, eastward past the Ninetyeast ridge to the Sumatra 
trench”, southward along the Ninetyeast ridge®, and southeastward 
throughout the Wharton basin*’. The southwestern part of the plate 
appears to have already fragmented to produce the Capricorn sub- 
plate**, which has a diffuse border with the Australian plate along the 
southern Ninetyeast ridge. The NNE-SSW trends of the Ninetyeast 
ridge and fracture zones in the Wharton basin, together with aligned 
left-lateral strike-slip faulting mechanisms in both areas, may lead one 
to anticipate a similar orientation for any great rupture in the intra- 
plate deformation zone. For example, the large 18 June 2000 (13.87° S, 
97.3° E; M, 7.9) earthquake in the Wharton basin appears to have 


involved predominantly left-lateral strike-slip faulting along the 
expected NNE-SSW orientation, although a second fault orientation 
was also activated during the 34-s-long rupture’"°®. However, the east- 
west trend of the equatorial deformation zone raises the possibility of 
right-lateral faulting further north. 

On 11 April 2012, a great intraplate earthquake (M,, 8.7) initiated at 
2.31°N, 93.06°E at 08:38:37 urc, followed by a great aftershock 
(My 8.2) at 0.77° N, 92.45° E at 10:43:09 urc (Fig. 1)’’. The overall 
faulting geometries of both events inferred from point-source moment 
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Figure 1 | The 11 April 2012 rupture sequence. Top inset, the regional plate 
tectonic setting, with the Indo-Australian plate being segmented into three 
subplates—India (IND), Australian (AUS) and Capricorn (CAP); also shown 
are the adjacent Somalian (SOM), Antarctic (ANT) and Sunda (SUN) plates. 
The light grey zone is a region of intraplate deformation between the subplates. 
Black arrows indicate directions of intraplate compression and white arrows 
show extension from subduction’. Major bathymetric features like the Chagos 
bank, Ninetyeast ridge and Wharton basin are indicated. Main panel, the USGS 
W-phase moment tensor solutions for the great events of 11 April 2012 
(beachballs), USGS one-week aftershock locations (red circles with magnitude 
scale at lower left), and location and focal mechanism for the 10 January 2012 
(M,, 7.2) event used as empirical Green functions in the surface wave analysis. 
The barbed line indicates the Sumatra trench, and the rupture zone of the M,, 
9.2 Sumatra-Andaman underthrusting event'* is shown. Bathymetry is shown, 
with the colour scale at the lower right. White arrows indicate the direction of 
motion of the Indo-Australian composite plate relative to the Sunda plate. 
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tensor representations are similar (Fig. 1), with steeply dipping strike- 
slip orientations having either left-lateral slip on NNE-SSW faults or 
right-lateral slip on WNW-ESE faults, either of which would be con- 
sistent with the pervasive northwest-southeast compressional stress 
orientation throughout the region’. The first event is possibly the 
largest strike-slip earthquake ever seismically recorded (the 15 
August 1950 Assam (M,, 8.6) interplate earthquake in the eastern 
Himalayas is of comparable size’, but uncertain in faulting mech- 
anism’*">), and it is probably the largest intraplate earthquake ever 
seismically recorded. This event triggered increases in seismicity 
globally’®. The 2012 events are located 100-200km seaward of the 
Sumatra subduction zone in which the Indo-Australian plate is 
underthrusting the Sunda plate, offshore of the epicentral region of 
the 26 December 2004 Sumatra-Andaman (M,, 9.2) interplate earth- 
quake rupture (Fig. 1). The 2004 event involved 15-30 m of trench- 
perpendicular coseismic displacement on the plate boundary’’, and 
increased northeast horizontal extensional stress in the 2012 source 
region’*. The ocean lithosphere varies in age from about 45 to 65 
million years from east to west across the source region, although 
thermal rejuvenation along the Ninetyeast ridge may reduce the effec- 
tive age there by up to 20 million years (ref. 19). 

The relative epicentral locations of the two great events on 11 April 
2012 immediately suggest rupture on a NNE-SSW-striking fault, and 
numerous rapid analyses of the seismic waves performed to characterize 
the space-time faulting history made this assumption; however, as more 
aftershock locations were determined, clear trends along parallel 
WNW-ESE lineations offset in latitude by ~150km soon became 
apparent (Fig. 1). A robust seismic method for identifying fault planes 
and rupture spatial extent is by stacking short-period P-wave signals 
from networks of seismic stations at teleseismic distances corrected 
for propagation from a grid of possible source locations”. Within 
a few hours of the events, such semi-automated back-projections of 
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Figure 2 | Short-period seismic energy release pattern. a, Locations of 
seismograph stations in Europe that were used for the 0.5-2.0 Hz back- 
projection of the M,, 8.7 and M,, 8.2 Sumatra earthquakes. Signals were selected 
for high P-wave similarity (P-wave signal correlation coefficients relative to the 
array average are indicated) and a broad geographical distribution. b, Local 
beam power maxima during the back-projections, with colour indicating time 
after the nominal origin time. Symbol size is proportional to beam power. The 
dashed line is the location of the Sumatra trench. White stars are the epicentral 
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short-period P waves from the mainshock recorded by networks of 
stations in Europe, China, Japan and Australia showed that pulses of 
coherent short-period seismic energy release appeared to illuminate 
both WNW aftershock trends, as well as additional loci of seismic 
radiation that were clearly not on a single NNE-SSW-oriented fault”. 

A summary of our own back-projections of short-period teleseismic 
P waves recorded by stations around Europe is presented in Fig. 2 
(a time-varying animation is provided in Supplementary Movie 1). 
Coherent sources of short-period seismic energy radiation are imaged 
for more than 160s and display a close correspondence with the 
regional distribution of epicentres of early (first-week) aftershocks, 
which are clearly on multiple faults. The initial rupture on the north- 
eastern WNW-ESE fault appears to expand bilaterally, with stronger 
radiation in the western part of the fault. After about 50s the WNW 
rupture propagation ends, with a total fault length of about 150 km and 
a low overall rupture velocity of 1.5-2.0kms ‘. Around 40s into the 
rupture process, seismic radiation begins to be emitted from a perpen- 
dicular, presumably left-lateral conjugate plane that first ruptures 
50-100 km in the SSW direction (from 30 to 60s), then 50-100 km 
in the NNE direction (from 55 to 90s). After about 70s, seismic 
radiation continues on a second WNW-ESE fault that is roughly 
parallel to the first and separated by ~150 km to the southwest. This 
rupture propagates to the WNW, perhaps discontinuously, until about 
145 s, at which time seismic radiation is apparent on a fourth distinct 
fault further to the west and persists until ~ 160 s. Also shown in Fig. 2 
are back-projection results for the My 8.2 aftershock that commences 
just over 2h later. The region of short-period energy release is much 
more compact than that of the mainshock, and the duration is roughly 
half as long (~80-85s). The short-period energy release indicates 
bilateral rupture on a NNE-SSW plane with dominant propagation 
initially from the epicentre towards the NNE, consistent with the loca- 
tions of early aftershocks, and weaker late energy release to the SSW. 
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locations of the M,, 8.7 and 8.2 earthquakes. c, Normalized time integrated 
relative beam power in units of cm’s 7 (colour scale at right ranges linearly 
from white for zero to purple for unity) for the M,, 8.7 back-projection. The 
circles are early NEIC aftershocks with symbol size proportional to magnitude, 
and the dashed line is the Sumatra trench. The inset shows peak beam power as 
a function of time. d, Same as c but for the M,, 8.2 aftershock. The time- 
dependent behaviour is best evaluated by viewing Supplementary Movies 1 and 2. 
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Similar patterns of short-period radiation are observed for arrays of 
stations in Japan (see Supplementary Fig. 1 and Supplementary Movies 
1 and 2), and by other investigators who have posted preliminary 
solutions online*”’. Using a lower-frequency passband of 0.1-0.5 Hz 
for the European P-wave observations smears the energy more broadly 
in space and accentuates the late energy in the mainshock rupture that 
occurs near longitudes of 89.5°-90.5° E, but otherwise gives results very 
similar to those in Fig. 2 (Supplementary Movies 1 and 2). 

An important limitation of these and other short-period back- 
projections is that they are not directly sensitive to magnitude of fault 
slip (short-period seismic radiation is sensitive to slip-rate variations, 
and back-projection methods depend critically on wave field coher- 
ency, which can be high for spatially concentrated radiation from a 
small event and low for diffuse radiation from a large event)?*”*”*. 
Thus, whereas the short-period results indicate multiple potential acti- 
vated faults, the relative seismic moments of the fault segments are not 
resolved. 

To overcome this deficiency, we used broadband surface waves to 
image spatial variations in seismic moment release, again allowing for 
the possibility of multiple fault segments. To account for complex 
surface wave propagation effects, we use global seismic recordings of 
short-arc Rayleigh waves (R,) and Love waves (G,) for a moderate size 
(M,, 7.2) event on 10 January 2012 with a source location (2.45°N, 
93.21° E; 18:36:59 utc) about 20km northeast of the mainshock 
epicentre (Fig. 1), and a similar strike-slip focal mechanism. These 
signals are deconvolved from the corresponding M,, 8.7 and 8.2 
great-event recordings to produce time series called source time func- 
tions” (STFs). The large earthquake rupture properties can be inferred 
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by projecting the STFs into a space-time history of seismic radiation in 
essentially the same way as is done in short-period back-projections. 

The spatial distributions of long-period seismic wave radiation 
imaged over a gridded region around the M,, 8.7 and 8.2 earthquake 
epicentres are shown in Fig. 3. Darker colours indicate stronger long- 
period source radiation, which tracks the aftershock distribution. 
There is some intrinsic smearing of the images due to the non-uniform 
and incomplete azimuthal coverage, and the spatial resolution is lower 
than for the short-period back-projections because we are dealing with 
one-sided moment rate functions and lower frequency signals; 
however, this method provides a better indication of seismic moment 
distribution. Plots of this imaging with R, and G, separately are shown 
in Supplementary Fig. 2. 

For the M,, 8.7 event, the largest long-period source energy is con- 
centrated near the epicentre, but there is significant source energy 
along both ESE-WNW aftershock trends, indicating either continu- 
ous rupture on corresponding faults or a sequence of discrete 
coseismically triggered ‘early’ aftershocks. The secondary features have 
peak amplitudes from 20-30% of the largest features, indicative of 
significant relative seismic moment. Snapshots of the reconstructed 
long-period radiation as a function of time are shown in Supplemen- 
tary Fig. 3; these reveal rupture propagation in the WNW direction on 
both faults, and that the concentration of radiation almost 400 km west 
of the epicentre occurs at about 120s. The relative STFs for the M,, 8.2 
aftershock were similarly processed (Supplementary Fig. 4), and indi- 
cate source radiation concentrated near the hypocentre with rupture 
propagation towards the NNE. Comparison with Fig. 2 shows good 
agreement in map locations of source radiation for ~1 s period energy 
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Figure 3 | Long-period seismic energy release pattern. a, b, Maps showing 
the spatial patterns of long-period surface wave energy sources for the 11 April 
2012 (M,, 8.7) mainshock (a; epicentre indicated with a red outlined star) and 
the large (M,, 8.2) aftershock 2 h later (b; epicentre indicated with a red outlined 
star). The one-week aftershock distribution is shown by small circles with radii 
proportional to seismic magnitude. The colour images indicate the peak 
stacked energy at each position from combined R, source time functions (STFs) 
and G, STFs back-projected relative to the epicentral location of the 10 January 
2012 (M,, 7.2) event (orange stars) which was used for empirical Green 
functions (EGFs). Solid lines indicate the orientations of likely subfaults that 
ruptured during each event. Snapshots that indicate the time history of energy 
release are shown in Supplementary Fig. 3. Coloured circles in the mainshock 
image indicate specific locations of energy release that produce arrivals 
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considered in c, which shows the azimuthally binned and stacked R, (red) and 
G, (blue) STFs plotted in time relative to the EGF epicentre. These represent 
seismic moment as a function of time observed at different azimuths. The 
coloured curves correspond to the coloured circles in a, showing the predicted 
arrival times of energy from each location, with ‘S-curve’ patterns due to relative 
propagation times towards different azimuths. The solid curves are for R; waves 
using a velocity of 4.0kms_’ and the dashed curves are for G, waves using a 
velocity of 4.5kms '. The features aligned on such curves form the 
corresponding images in a, although images formed after 60 s of delay from the 
STF onsets have the first 60 s of the STFs masked out to avoid contamination. A 
corresponding profile of STFs for the M,, 8.2 aftershock is shown in 
Supplementary Fig. 4. 
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and ~20-200 s period energy, bolstering the interpretation that mul- 
tiple faults with significant slip were activated during the main rupture. 
Neither imaging method provides depth resolution. 

To resolve the spatial slip distribution, broadband teleseismic P and 
SH waves were inverted together with the surface wave STFs for finite- 
faulting models using multiple distinct fault segments inferred from 
the back-projections of short- and long-period seismic energy. 
Although single-fault inversions allow basic waveform features in 
the first 60s to be fairly well modelled, the overall waveforms can be 
fitted better with additional fault complexity (and more parameters). 
Finite-fault inversions require specification of substantial a priori 
information about faulting geometry, rupture initiation time, rupture 
velocity, and discretization of the fault surface and subfault STFs. We 
use the consistency of the short-period and long-period imaging 
approaches in conjunction with the aftershock trends to specify four 
primary subfaults for the mainshock and one segment for the largest 
aftershock, with locations and timing of rupture initiation guided by 
the high-resolution short-period back-projections. Dip 6 and rake 4 
orientations for each subfault with strike ¢ were constrained by extens- 
ive modelling and inversion suites. 

Figure 4 shows the fault model geometry and the slip distributions 
obtained by least-squares inversion of P and SH waves using up to 
200-s-long time windows of the seismograms (with some being 
truncated early to avoid contamination from PP and SS phases) along 
with the R; and G, STFs. Guided by the short-period back-projections, 
we specified the rupture velocity as 2.0kms ' onall segments. There is 
uncertainty in the kinematic parameters, but the collective informa- 
tion from aftershock locations and P wave and surface wave back- 
projections does provide first-order constraints. 

Our results show that the great events of 11 April 2012 involve 
rupture of a very complex network of faults, for which we have no 
documented precedent in recorded seismic history. Good fits are 
found for the large P and SH wave and R, and G, data set (see 
Supplementary Fig. 5), although some secondary features are not fully 
accounted for. The failure process can be summarized as follows: first, 
a large moment release during a predominantly WNW-propagating, 
asymmetric bilateral strike-slip rupture (with 2.0 kms ' velocity) with 
large peak slip (~37 m) and about 150 km total rupture length with 
seismic moment corresponding to M, ~8.5. Second, this rupture 
triggered bilateral failure of a cross-cutting orthogonal fault that had 
a seismic moment corresponding to M,, ~7.9. Third, subsequent rup- 
ture occurred to the south on an en échelon ESE-WNW fault that 
expanded in the WNW direction with a seismic moment of M,, 
~8.3. Fourth, the process culminated with a fourth activated fault 
segment about 300 km west of the epicentre that ruptured on either 
a WNW-ESE or NNE-SSW fault (or both) with M, ~7.8 (Sup- 
plementary Fig. 6 shows the inversion results assuming the NNE- 
SSW orientation). The cumulative moment of these ruptures is 
13.9 X 10°'Nm, which gives M,, 8.7 (about 40% larger than point- 
source seismic moment estimates, and 15% larger than a two-subevent 
seismic moment estimate*’). A modest non-double couple component 
is found when the individual subfault moment tensors are summed 
directly or allowing for temporal shifts (Supplementary Fig. 7). 

Even allowing for the trade-offs and non-uniqueness of the very 
complex model description, it is well-established that this event acti- 
vated a complex lattice of faults in the deformation zone between the 
India and Australia subplates, with the deep centroid depths, large 
estimated fault displacements and large extent of faulting of the 
sequence suggesting localization of deformation in the region extend- 
ing westward from Sumatra to the Ninetyeast ridge. The shortening 
between the India and Australia subplates that is being accommodated 
across this deformation zone is mainly being distributed onto strike- 
slip faults rather than thrust faults, and the ultimate configuration of 
the plate boundary that will develop is difficult to anticipate. The 
failure process is somewhat influenced by the plate fabric, with 
NNE-SSW-trending structures embedded in the plate from its earlier 
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Figure 4 | Map of primary faulting during the M,, 8.7 event. a-f, Slip 
distributions on each subfault; g, map showing locations of subfaults. Map 
shows fault segments activated during the 11 April 2012 M,, 8.7 mainshock, 
specified for finite-fault slip model inversion of teleseismic broadband P and 
SH waves and R, and G, STFs. The faulting complexity is guided by the short- 
period and long-period source imaging in Figs 2 and 3 (and associated 
animations and time snapshots in the Supplementary Information) along 
with the one-week aftershock distribution. The red star indicates the 
epicentral location, and red dots indicate the placement of hypocentres (all at 
30 km depth) on each fault segment. The rectangles indicate the subfault 
strikes and dips (shallow edge is on the green side, deeper edge on the black 
side. We use four subfaults (two of the subfaults are further subdivided) with 
onset times (To) constrained by the short-period back-projections. Slip 
distributions on each subfault (arrows indicate the relative size and direction 
of slip, with slip size colour contoured using the colour scale at right; green 
dots indicate the grid) are shown in a-f, with the subfault grids having 20 km 
spacing along-strike and 10 km spacing along-dip. The peak slip and M,, of 
each subfault is indicated, as is the position of the hypocentre on each subfault 
from which the rupture expands. The rupture velocity is 2.0kms_' for all 
subfaults. The M,, 8.2 event (epicentre given by black star) is not inverted 
because the broadband P and SH waves are obscured from surface waves from 
the first event, but it appears to involve bilateral rupture on a fifth fault, 
trending NNE-SSW. 6, dip; 2, rake; ¢, strike. 


formation probably providing zones of weakness that can fail in strike- 
slip events. However, the primary seismic moment is on WNW-ESE 
features, which cross-cut the trend of the Ninetyeast ridge. Large rup- 
ture through oceanic lithosphere cross-cutting fracture zones has been 
observed in the Antarctic plate’. High-resolution bathymetry swaths 
along the ridge have revealed many WNW-ESE fractures in the sea- 
floor before the 2012 sequence”, and these young features appear to 
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have dominated the faulting in this great earthquake sequence. Many 
more large fracturing events will be needed to evolve a localized plate 
boundary, so future extraordinary ruptures in the region may well 
occur, but this event currently stands as the largest-magnitude intra- 
plate strike-slip earthquake geophysicists have yet recorded. 


METHODS SUMMARY 


For a gridded distribution of possible rupture locations across the source region, 
short-period body waves and long-period surface waves from teleseismic stations 
were back-projected to image the space-time patterns of seismic wave radiation for 
the M,, 8.7 and 8.2 events, with minimal a priori assumptions about faulting 
geometry. Narrow-band filtered P waves from large aperture arrays in Europe 
and Japan were used to image locations of short-period radiation on the source 
grid using predicted travel-time variations across the arrays relative to an initial 
hypocentral alignment by shifting and fourth-root summing of the signals. 
Broadband R, Rayleigh waves and G, Love waves from global seismic stations 
were used to image the space-time locations of long-period seismic radiation. The 
complex dispersion effects of surface wave propagation were first removed using 
corresponding recordings from an M,, 7.2 earthquake on 10 January 2012 with 
very similar location and focal mechanism to the M, 8.7 mainshock as empirical 
Green functions (EGFs). The EGF signals were deconvolved from the records of 
the large events, eliminating long-range propagation effects and extracting relative 
source time functions (STFs) for the large event. The STFs were then back- 
projected onto the source grid using average phase velocities for R, (4.0kms_') 
and for G; (4.5kms '). Wethen specified a set of four fault segments composed of 
multiple 10-km-wide by 20-km-long subfaults that activate at prescribed times 
(based on the short-period imaging), and invert broadband teleseismic P and SH 
waves and the surface wave STFs for finite fault slip distributions. The geometries 
of the segments are constrained by aftershock distribution and focal mechanisms, 
back-projections, and suites of inversions with varying parameters. The final 
model has time-varying slip on each fault segment consistent with the back- 
projection imaging, aftershocks, and broadband teleseismic signals. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Short-period back-projection imaging. Back-projections of short-period 
teleseismic P waves were carried out for the My 8.7 and M, 8.2 earthquakes of 11 
April 2012 using large aperture arrays of seismometers drawn from various European 
broadband seismic networks and the F-net array of broadband seismometers in 
Japan. In each of the four cases, conventional time-domain back-projection® was 
used with a spatial grid bounded in longitude by 88°-96° E and in latitude by 
1°S-6°N, using 0.1° increments. The depth was held fixed at the nominal USGS 
hypocentral values of 23 km and 16 km, for the mainshock and aftershock, respect- 
ively. Stacked seismogram beam power was calculated using fourth-root stacking 
over a tapered, 10-s-long window that was shifted in increments of 1 s. Time shifts for 
back-projecting the seismic energy to the source region were calculated for the AK135 
reference model™, and initial static station corrections were derived using multi- 
channel cross-correlation®* (MCCC) of the initial 10 s of P-wave energy. 

For the European back-projection of the My 8.7 event, 196 vertical broadband 
channels in the distance range of 43°-100° were downloaded from the ORFEUS 
Data Center (www.orfeus-eu.org). The data were examined for quality and ulti- 
mately 85 seismograms, each with a mean cross-correlation value >0.7 from an 
MCCC analysis of unfiltered traces, were selected for back-projection. The data 
were filtered into passbands of 0.5-2.0Hz and 0.1-0.5 Hz for separate back- 
projections. Some stations with high-quality data were not used because they were 
redundant with other nearby high-quality stations. We checked that the array 
response”! of the final configuration was more compact and symmetric than that 
corresponding to using all viable European data (a total of 157 traces). The final 
data set includes traces in the distance range 55° to 95°. The same set of 85 stations 
was used in back-projecting data from the M,, 8.2 event, although static station 
corrections were recalculated with MCCC for the initial 10 s of energy in a passband 
of 0.2-2.0Hz, achieving a minimum mean correlation coefficient of 0.6. The 
similarity of the My 8.2 P waves was degraded by coda from the mainshock and 
MCCC analysis on unfiltered traces was not viable. The root-mean-square differ- 
ence in the two sets of statics is 0.13 s, and back-projections of the M,, 8.2 event data 
using the statics for the M,, 8.7 event gives similar results to what is shown in this 
Letter, although there is a slight southward translation of the beam power. 

For the back-projection of F-net data for the M,, 8.7 event, 72 vertical, broadband 
channels in the distance range of 36°-62° were downloaded from the NIED data 
centre (www.fnet.bosai.go.jp). The data were examined for quality and ultimately 
67 seismograms, each with a mean cross-correlation value >0.88 from an MCCC 
analysis of unfiltered traces, were selected for back-projection. The aligned data 
were filtered in the passband 0.5-2.0 Hz for stacking. Because of the quasi-regular 
spacing of F-net stations, no seismograms were deleted in an effort to improve the 
array response and equalize data importance. The higher similarity of the Japan P 
waves compared to European P waves is offset by the smaller aperture of F-net and 
we consider the European results to be more robust. The same 67 stations were used 
to back-project the M,, 8.2 event data, but again with statics newly derived from 
MCCC analysis of the initial 10s of P-wave energy. As with the European data, 
similarity was lower than for the mainshock, with a minimum mean correlation 
coefficient of 0.52, and the root-mean-square difference in the two sets of statics was 
0.27 s. Nevertheless, Japan back-projections for the M,, 8.2 data using statics for the 
mainshock again give results similar to what are shown in this Letter. 

Surface wave STF imaging. Surface waves that travel along the short great-circle 
arc (R; Rayleigh waves and G, Love waves) from the source region to broadband 
seismic stations around the world are used to image the space-time locations of 
long-period seismic radiation from the M,, 8.7 and M,, 8.2 events. Group velocity 
windows that varied with epicentral distance were used to isolate the fundamental 
mode energy from overtones and long-arc arrivals. To remove the complex 
dispersion, attenuation, and focusing effects of surface wave propagation, R; 
and G, recordings from an M,, 7.2 earthquake that occurred on 10 January 
2012 with very similar location and focal mechanism to the M, 8.7 mainshock 
are used as empirical Green functions (EGFs)**. We inverted P waves for the EGF 
event to ensure that the catalogue focal mechanism was correct and to establish 
that the source process was not anomalous. The individual horizontal component 
instrument responses were deconvolved before rotation for the G, waves, ensuring 
good isolation of the transverse motions. The rotated G, signals were bandpass 
filtered in the passband 0.005-1.0 Hz. We used 525 Rayleigh waves and 485 Love 
waves for the final analysis. Only stations at distances less than 150° were used to 
avoid contamination from the R2 and G; arrivals. Visual comparisons of low-pass 
filtered traces of all of the EGF and large event signals were made to ensure 
adequate signal coherence and long-period signal stability, retaining stable signals 
away from radiation nodes. The selected EGF signals were deconvolved from the 
corresponding records of the large events using an iterative time-domain decon- 
volution procedure with a positivity constraint’. This eliminates long-range dis- 
persive propagation effects and yields relative source time functions (STFs), which 
indicate the difference in overall source radiation time history for the large events 


relative to the EGF. The relative STFs were then convolved with an estimate of the 
STF for the EGF event obtained by finite-fault inversion of P waves, giving absolute 
STFs for the main shock. A 20 s corner low-pass filter was applied to remove short- 
period signal components from all STFs. 

The surface wave STFs were averaged in 10° azimuthal windows, then back- 
projected over a source grid similar to that for the short-period P wave procedure, 
but using average phase velocities of 4.0kms ‘for R; and 4.5kms_' for G;. These 
phase velocities are appropriate for about 80 s period waves, which are in the centre 
of the passband of the deconvolved signals. There can be some distortion of the 
STFs due to variation in propagation distances for a finite source, and this effect 
increases with source location difference from the EGF location, but the basic 
character of the STFs is preserved by the positivity constraint in the deconvolution. 
In the stacking, we applied azimuthal weighting of the STFs for R, and G; pro- 
portional to the sin(20) and sin(20 + 45°) azimuthal radiation patterns for strike- 
slip events, downweighting STFs for near-nodal directions where small errors in 
the EGF can produce unstable STF estimates. STF features that have systematic 
move-out of arrivals at azimuthally distributed stations sum constructively for 
corresponding source space-time locations. This provides images of where long- 
period radiation emanated from the source region independent of assumption of 
any particular fault configuration, although we assume the focal mechanisms are 
consistent with the EGF. The relative stack amplitudes are indicative of relative 
seismic moment of secondary sources in the rupture process, but some uncertainty 
is produced by the possibility of small errors in the EGF geometry between sube- 
vents, differential path length attenuation effects, and interference with arrivals 
from other subevents. Experiments with simulations indicate good (~10%) recovery 
of localized subevent moment under favourable imaging conditions. 

Finite fault slip model inversion. Given the complex space-time distributions of 
source radiation indicated by the short-period P-wave and long-period R, and G; 
STF back-projections, along with the correlated aftershock distribution, we specify 
a set of fault segments composed of multiple 10-km-wide by 20-km-long subfaults 
extending from the ocean floor (below a 5-km-thick ocean layer) to ~55-km depth 
that activate at prescribed times (based on the short-period imaging), and invert 
broadband teleseismic P and SH waves for finite fault slip distributions. The 
geometries of the segments are constrained by aftershock distribution and focal 
mechanisms, back-projections, and suites of inversions with varying parameters. 

The rupture initiates at 30km depth on each fault segment. Resolution of 
hypocentral depths is poor, but the choice was made based on the 30-50 km 
centroid depths found in very long-period (>200s) point-source inversions for 
this event, suggestive of rupture extending throughout the oceanic lithosphere. We 
specify rupture velocity as 2kms ' for each segment guided by the back- 
projections, and allow a flexible parameterization of the source functions on each 
subfault. For fault segments (a,b,c,d,f) each subfault STF is parameterized with 
seven symmetric 3-s rise time triangles offset by 3-s each, allowing total subfault 
durations of 24 s, whereas fault segment e has subfaults with five similar triangles 
with total allowed subfault durations of 18s. The complexity of slip does not 
provide tight constraints on the subfault durations. 

Based on many modelling efforts, we specify the rake on each subfault, allowing us 
to invert the body waves and surface wave STFs simultaneously. Fault dip was 
specified after performing inversions for a range of values. We found that allowing 
the dip to vary from the eastern half to the western half of the first fault enabled 
significant improvement in fit to P waveforms at azimuths to the WSW. For most 
fault segments the body wave inversion concentrates seismic moment and, hence, slip 
in the upper three rows of each fault model (depths of 5-35 km below ocean surface), 
spanning the thin oceanic crust and uppermost mantle layer, but deeper slip is found 
on several of the segments, notably segment a. Relative to the P waves, the SH signals 
are weighted by a factor of 0.2 to balance their amplitudes, while relative to the body 
waves the surface wave STFs are weighted by a factor of 5 to ensure good constraint on 
the total moment. The residual waveform mismatch power is 29% for the body waves 
and 19% for the STFs. The final result is a model of time-varying slip on each fault 
segment consistent with the back-projection imaging, aftershocks, and broadband 
teleseismic signals. The many parameters in such a complex model make the finite- 
source kinematic slip inversion even more non-unique than for single fault models, so 
it is best viewed as a plausible realization of overall rupture process for the event. 
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The 11 April 2012 east Indian Ocean earthquake 
triggered large aftershocks worldwide 


Fred F. Pollitz', Ross S. Stein’, Volkan Sevilgen? & Roland Biirgmann® 


Large earthquakes trigger very small earthquakes globally during 
passage of the seismic waves and during the following several 
hours to days'""*, but so far remote aftershocks of moment mag- 
nitude M = 5.5 have not been identified"', with the lone exception of 
an M = 6.9 quake remotely triggered by the surface waves from an 
M= 6.6 quake 4,800 kilometres away”. The 2012 east Indian Ocean 
earthquake that had a moment magnitude of 8.6 is the largest strike- 
slip event ever recorded. Here we show that the rate of occurrence 
of remote M= 5.5 earthquakes (>1,500 kilometres from the epi- 
centre) increased nearly fivefold for six days after the 2012 event, 
and extended in magnitude to M = 7. These global aftershocks were 
located along the four lobes of Love-wave radiation; all struck where 
the dynamic shear strain is calculated to exceed 10’ for at least 100 
seconds during dynamic-wave passage. The other M= 8.5 main- 
shocks during the past decade are thrusts; after these events, the 
global rate of occurrence of remote M=5.5 events increased by 
about one-third the rate following the 2012 shock and lasted for 
only two days, a weaker but possibly real increase. We suggest that 
the unprecedented delayed triggering power of the 2012 earthquake 
may have arisen because of its strike-slip source geometry or because 
the event struck at a time of an unusually low global earthquake rate, 


a Source faults 


perhaps increasing the number of nucleation sites that were very 
close to failure. 

The 11 April 2012 M = 8.6 east Indian Ocean earthquake (Fig. 1a) is 
by far the largest strike-slip event ever recorded’’. It was a complex 
event rupturing a series of subparallel and conjugate faults with 
dominant moment release within a ~100-s time span'*”’. This 
intra-oceanic earthquake precipitated a large, abrupt increase in 
seismicity worldwide (Fig. 2a). This seismicity reached remote 
distances of 10,000-20,000km from the mainshock (Fig. 1b, c), 
much wider than the near-field aftershock zone surrounding a large 
mainshock, which is generally a few fault lengths in size**. The 
seismicity jump involved increases in earthquake productivity across 
a broad magnitude spectrum (Fig. 3). At rates well above background, 
several M = 5.5 aftershocks were triggered, unlike those in all previous 
remote-triggering cases’"°. 

The global distribution of these remote aftershocks is consistent 
with Love-wave radiation from the mainshock. We consider the square 
root of the second invariant of the deviatoric strain tensor, el the dyna- 
mic shear strain realized during passage of the seismic waves, using 
a point source convolved with a 100-s-long source time function 
that replicates observed seismic waveforms (Methods Summary). 


Figure 1 | The 2012 M= 8.6 mainshock and 
M= 8.2 aftershock fault ruptures and maps of 
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0.1 microstrain. a, Inferred fault ruptures of the 11 
April 2012 M = 8.6 east Indian Ocean earthquake 
and an M = 8.2 aftershock that occurred 2h later. 
Superimposed are the first 20d of M= 4.5 
aftershocks of 0-100-km depth. These earthquakes 
probably ruptured a complex set of subparallel and 
conjugate faults with the indicated sense of 
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motion'*’ (arrows). Parts of the rupture areas of 
the 2004 M = 9.2 and 2005 M = 8.7 Nias 
earthquakes on the Sunda megathrust are 
indicated. b, c, Global maps of Train (colour scale). 
Superimposed are the epicentres of M = 5.5 events 
that occurred during the 6d preceding the 
mainshock (2 epicentres) and following the 
mainshock (24 epicentres, 16 of which are remote, 
that is, >1,500 km from the mainshock). Focal 
mechanisms of six post-mainshock events with 
near-vertical strike-slip mechanisms (plunge of 
neutral axis, >60°) are indicated with red 
beachballs. The 9:00:09 11 April 2012 M=5.5 
event (in the western Aleutian Islands) occurred 
21 min 33 s after the mainshock between the direct 
P- and S-wave arrivals from the mainshock; all 
others are delayed by hours to days. The focal 
mechanism of the mainshock is plotted at its 
epicentre. 
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Figure 2 | Global rates of shallow (depth, <100km) M= 5.5 earthquakes 
during the 10d preceding and following a mainshock. Events within 

1,500 km of the mainshock are excluded. A running average with a half-width 
of 1d is used to construct each curve. Time is relative to the mainshock origin 
time. a, 11 April 2012 M = 8.6 east Indian Ocean mainshock. b, A collection of 
five other mainshocks with M = 8.5. Red horizontal lines denote the M = 5.5 
seismicity rate for the 10 d preceding the 2012 mainshock (a) or event-averaged 
rates for the 10 d preceding the five other mainshocks (b). Gray horizontal lines 
and grey shading denote mean M = 5.5 seismicity rates and the 5% and 95% 
empirical probability bounds obtained from analysis of a 20-yr-long NEIC 
catalogue (Supplementary Fig. 4 and Methods section on background 
seismicity rates). Green horizontal lines give the 95% empirical upper bounds 
on R,ost for one event (a) or an aggregate of five events (b) derived from a set of 
catalogue M=7 mainshocks (Supplementary Fig. 7). 


We define a first measure of dynamic strain, the ‘strain duration’ 
Tstrainy aS the amount of time that e” exceeds a threshold value during 
the wave propagation. Figure 1 reveals that M= 5.5 aftershocks lie 
within the four lobes of high Tstrain (above a threshold strain of 10°’, or 
0.1 microstrain, for >100s), which coincide with the theoretical 
lobes of maximum Love-wave excitation for a mainshock of its 
strike-slip source geometry. A second measure is the peak value of 
é imparted by the mainshock during passage of the seismic waves. 
Maps of this peak (Supplementary Fig. 1) reveal that M= 5.5 after- 
shocks similarly lie within the four lobes of peak dynamic shear strain 
(>0.25 microstrain). Both measures are important for rationalizing 
the occurrence of dynamically triggered events'°*!”’. 

These unprecedented observations raise the question of whether 
large aftershocks are always triggered at great distances by large main- 
shocks. Only very small dynamically triggered remote aftershocks of 
M=7 mainshocks have been found’, and M>5 aftershocks during 
the first ~30 h have been found to be triggered only within ~ 1,000 km 
from M=7 mainshocks"! (generally within the range of static 
Coulomb stress triggering), suggesting that the global hazard of large 
aftershocks does not increase following even a large mainshock. Thus, 
the M = 8.6 east Indian Ocean triggering is probably quite rare. 
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Figure 3 | Cumulative number of global M= 4.5 events of depth <100 km 
during the 6 d before and after the 2012 event. Both the pre-mainshock (blue 
symbols) and the post-mainshock (red symbols) seismicities are restricted to be 
remote (>1,500 km from the mainshock). Superimposed is the cumulative 
number of background ‘remote’ events in an average 6-d interval during the 
year preceding the 2012 mainshock (green symbols). Lines illustrate the 
corresponding b values from the Gutenberg—Richter law and standard 
deviations derived using maximum-likelihood regression. 


To confirm that the global rate increase is real, we test the null 
hypothesis that the rate of remote M=5.5 events is the same for 
periods before and after M=8.5 mainshocks (Table 1), including 
the 2012 east Indian Ocean earthquake. The remoteness criterion is 
defined with a simple distance threshold from the mainshock centroid 
and is applied uniformly to both pre-mainshock and post-mainshock 
seismicity. Because the enhancement of dynamically triggered seismicity 
is thought to take place on a 1-2-d timescale'’, we test whether the 
average rate of remote M = 5.5 aftershocks in the 2 d following a main- 
shock, Rpost is the same as the rate, Rp,e, of ‘remote’ M = 5.5 events in the 
10d preceding the mainshock. We use 2 d post-mainshock because of 
the observed duration of the rate increase and 10d pre-mainshock to 
obtain a reliable background rate. Results are similar using other pre- 
mainshock windows, and here we address the statistical fluctuations in 
pre-mainshock rates to be expected from the chosen time window. We 
define the rate change to be 


AR= Roost — Rpre (1) 


and test the null hypothesis by comparing the observed AR with 
empirical probability distributions of AR derived from Monte Carlo 
sampling of the 20-yr global US National Earthquake Information 
Center (NEIC) catalogue of M = 5.5 earthquakes (see Methods sections 
on magnitude of completeness and seismicity rate change). 

There is a fivefold increase in the rate, R, of shallow global M= 5.5 
events for several days following the 2012 mainshock (Fig. 2a). A 
similar but weaker rate change is observed for a collection of five other 
M= 8.5 mainshocks during the past decade (Fig. 2b and Table 1), 
exceeding the 10-d-average pre-mainshock rate by a factor of two 
for 2d. The jump in seismicity rate is not attributable to induced 
mainshock—aftershock sequences (that is, aftershocks triggering after- 
shocks) and is also apparent over longer time intervals (Supplementary 
Fig. 2). 


Table 1 | M=8.5 earthquakes 


Date Magnitude* Region/name Tectonic environment Sense of slip 
(dd/mm/yyyy) 

26/12/2004 9.2+ Sumatra Sunda megathrust Thrust + strike slip 
28/03/2005 = 8.7+ Nias Sunda megathrust Thrust 
12/09/2007 8.5 Sumatra Sunda megathrust Thrust 
27/02/2010 8.8 Maule, Chile Andean trench Thrust 
11/03/2011 9.0 Tohoku, Japan Japan trench Thrust 
11/04/2012 8.6 East Indian Ocean Intraplate Strike slip 


* From NEIC catalogue unless otherwise noted. 
+ Ref. 32. 
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The rate change AR provides a test of the null hypothesis. As 
summarized in Methods, we consider the set of 243 shallow-focus 
M=7 mainshocks that occurred during the 20 yr before the 2012 
M = 8.6 event; the magnitude of completeness is 5.5 (Supplementary 
Fig. 3). For a single mainshock, the 95% upper bound on the empirical 
probability distribution of AR is 1.25 events per day. The observed rate 
increase for the 2012 M = 8.6 mainshock (Fig. 2a) is AR = 2.7 events 
per day, which far exceeds the 95% empirical bound. An increase AR 
exceeding 2.5 events per day is not found for any M = 7 mainshock in 
the 1992-2012 catalogue (Supplementary Fig. 5). If the search is 
extended back to 1982, then a AR value exceeding 2.5 events per day 
occurs for only one out of 337 M27 mainshocks in the catalogue 
(the 5 April 1990 M=7.4 normal-faulting Mariana Islands earth- 
quake). If remote M=5.5 events are assumed to occur with a 
Poissonian probability distribution, then one out of 337 mainshocks 
translates into a 0.3% probability of the 2012 rate increase occurring 
by chance. 

For the aggregate of five other M = 8.5 mainshocks, the 95% bound 
on the empirical probability distribution of AR is 0.55 events per day 
(Supplementary Fig. 6). The observed rate increase (Fig. 2b) is 
AR = 0.63 events per day, a weaker signal than the 2012 rate increase. 
Thus, the null hypothesis can be rejected at the 99.7% level for the 2012 
M = 8.6 mainshock and at the 95% level for the aggregate of five other 
very large mainshocks. 

The global nature of the delayed triggered seismicity following the 
east Indian Ocean earthquake and its association with the Love-wave 
radiation pattern raise a number of issues. The first is whether strike- 
slip earthquakes promote triggering of moderate global aftershocks 
more than thrust sources. The 2012 event is the only M= 8.5 main- 
shock with a predominantly strike-slip mechanism (Table 1). A strike- 
slip event will generate horizontally polarized shear-wave (SH- and 
Love-wave) energy in four dominant directions, along and perpendicular 
to the fault strike”. Love waves will preferentially stress near-vertical 
strike-slip faults. Six of the 16 (37%) remote aftershocks of the 2012 
event were strike-slip events on near-vertical faults (Fig. 1), whereas 
the background rate of M=5.5 strike-slip events is 24%; four of the 
strike-slip aftershocks were among the six that occurred during the first 
2d (67%). In contrast, low-angle thrust events generate predominantly 
P-SV-wave energy with smaller azimuthal variation of radiated 
energy”. For example, the 2004 M = 9.2 Sumatra and 2005 M = 8.7 
Nias earthquakes occurred near the 2012 M = 8.6 event but produced 
smaller remote seismic displacements (Supplementary Fig. 10). 
Assuming that dynamic strain scales with displacement”, this qualita- 
tively suggests an enhanced triggering potential for the 2012 event. 

A strike-slip source geometry, however, is of secondary importance 
to earthquake magnitude. We examined the next-largest strike-slip 
sources of the past 15 yr (the 23 December 2004 M = 8.1 Tasman 
Sea and 28 March 1998 M = 8.1 Antarctic earthquakes, both of which 
were about five times smaller than the 2012 M = 8.6 event), but we do 
not find any increase in remote M=5.5 seismicity. Nor do we see 
(using the Centennial Earthquake Catalog*) remote M=7 after- 
shocks following the 1905 M = 8.4 Mongolian earthquake doublet. 

The triggered aftershocks are not preferentially located in the near- 
field, where dynamic strain magnitudes are high, but rather are 
distributed uniformly over the globe (Fig. 1 and Supplementary Figs 1 
and 8a). The strain duration (Fig. 1) is more uniformly distributed 
globally than the peak strain amplitude (Supplementary Fig. 1). This 
suggests that a dynamic strain threshold enables triggering, as suggested 
previously’, and reinforces the notion that dynamic triggering depends 
on not only the amplitude of transient dynamic strains but also the 
duration for which they are applied*. Small events triggered by pas- 
sage of seismic waves from the 2011 Tohoku earthquake are also dis- 
tributed globally, with no preference for the near-field region’, 
suggesting that dynamic triggering, whether instantaneous or delayed, 
depends on more than dynamic strain amplitude. 
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The second issue is the nature of the mechanism behind the 
delayed triggering of large aftershocks. When compared with the near- 
instantaneous surface-wave triggering of small global aftershocks, the 
2-6-d timescale of global large aftershock activity implies a longer 
nucleation process. Near-instantaneous triggering of undetected 
foreshocks or slow-slip transients could initiate a cascade process 
culminating in earthquakes”*. Surface waves from large earthquakes 
have been found to trigger deep-seated non-volcanic tremor in several 
subduction zones and on the San Andreas fault”. This non-volcanic 
tremor is believed to be associated with slow slip. 

The third issue is whether the large jump in global seismicity 
following the 2012 earthquake is related to the very low seismicity rate 
preceding it. The 2012 quake struck after 6-12 d of exceptionally low 
global seismicity (Fig. 4 and Methods section on low pre-earthquake 
seismicity rate). We suspect that because the dynamic stress is 
oscillatory and brief, only faults very close to failure can be triggered. 
If earthquake nucleation sites age, or are stressed towards failure, at a 
roughly constant rate, then when a period of such extremely low 
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Figure 4 | Global seismicity rates during the 7.3 yr between the 2004 
Sumatra earthquake and the 2012 east Indian Ocean earthquake. 

a, Magnitude of M = 8 events. b, c, Seismicity rates for M = 4.8 events in 6-d 
bins (b) and M = 5.5 events in 12-d bins (c). Catalogue is unedited; for example, 
no 1,500-km exclusion zones are used. 
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seismicity coincides with a great mainshock, there is a large reservoir of 
nucleation sites that are critically stressed or very close to failure. In 
contrast, none of the other M = 8.5 earthquakes is preceded by such a 
6-12-d low background rate. Some of the close-to-failure sites may lie 
in aftershock zones of local mainshocks, such as one that is an after- 
shock of the 20 March 2012 M = 7.4 Mexican-trench event (Fig. 1). 

The strongest evidence for delayed global triggering is provided by 
the preponderance of strike-slip post-mainshock events (Fig. 1), the 
jump in seismicity rate in individual regions (for example the Gulf of 
California (Supplementary Fig. 11)) and the observation that the 
M=5.5 seismicity rate increase is part of a broader global M= 4.5 
rate increase (Fig. 3), which involves large numbers of remote events 
(114 post-mainshock versus 57 pre-mainshock for 6-d periods). 

The above considerations provide only partial answers to the 
question of why the 2012 event triggered so many remote large after- 
shocks. The apparent dependence of dynamic triggering on period, 
with longer periods having greater influence than shorter periods both 
experimentally and observationally*****, may be relevant for remote 
triggering because long-period waves undergo less intrinsic attenu- 
ation than do short-period waves. Directivity effects may also prove 
to be important”. Although these and other issues are speculative at 
this stage, the 2012 east Indian Ocean event has already shown that the 
phenomenon of remote earthquake triggering is not restricted to small 
earthquakes or tremor but extends to potentially damaging (M ~ 7) 
earthquakes. This carries implications for the effect of a large 
earthquake on the global seismic hazard, as the 2012 event caused 
strong on-land shaking (with a modified Mercalli intensity greater 
than or equal to seven) in Indonesia, Japan and Mexico. 


METHODS SUMMARY 

Determination of earthquake rates. We consider global rates, R, of shallow 
(depth, =100km) M = 5.5 earthquakes during the 10 d preceding and following 
a given mainshock. These rates exclude pre-mainshock and post-mainshock 
events within a spherical cap of radius 1,500km centred on the mainshock 
centroid. A running average in time is used to count events. For a target time tyre 
preceding the event, the events are summed in bins spanning a time interval 
(tpre — 1d, min{tpre + 1d, 0}), where time is relative to the mainshock origin time. 
Fora target time t,,,. following the event, the events are summed in bins spanning 
a time interval (max{tpost — 1d, 0}, tpost + 1d). This procedure is applied to the 11 
April 2012 mainshock or an aggregate of M = 8.5 mainshocks in which the rates 
are divided by the number of mainshocks used. With the above prescription, the 
2-d-average post-mainshock R value is the running-average R value evaluated at 
t= 10d; this is the post-mainshock rate, R,os, used to define the seismicity rate 
change in equation (1). 

Measure of dynamic strain. Long-period synthetic seismograms of length 2h 
33 min are calculated at 10-km depth at each of 5,150 points distributed globally 
for a point-source representation of the mainshock® convolved with a source 
time function consisting of a cosine ramp function of duration 100s. They are 
consistent with observed seismograms (Supplementary Fig. 9). At each point, 
seismograms for the six components of the strain tensor are converted into time 
series of é'', the square root of the second invariant of the deviatoric strain tensor”. 
We define Totrain to be the period during which ellis greater than a threshold value 
of 0.1 microstrain, using the time series of ¢”. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Magnitude of completeness. The magnitude of completeness M. of the NEIC 
catalogue has evolved over time. Supplementary Fig. 3 shows cumulative 
seismicity rates and the corresponding b values determined over three consecutive 
10-yr time periods. The 1982-1992 seismicity rates are generally lower than rates 
since 1992. The global M = 5.5 seismicity rates have been essentially stable since 
1992, the 1992-2002 rates diverging from the 2002-2012 values only for M< 5.5. 
This and the estimated b values and quality of the linear fits leads us to conclude 
that M.=5.5 is appropriate for the NEIC catalogue since 1992. A value of 
M-.= 4.8 is appropriate for the period between the 2004 Sumatra mainshock 
and the 2012 mainshock (see section on low pre-earthquake seismicity rate). 
Comparison of regional seismicity catalogues with the NEIC catalogue suggests 
that a value of M, = 4.5 is appropriate for at least the past year. 

Background seismicity rates. For the six M= 8.5 mainshocks of the past 10 yr, 
the M = 5.5 global seismicity rates presented in Fig. 2 are determined by excluding 
spherical caps of radius 1,500km about a given mainshock epicentre for a 
particular time window. This procedure—applied equally to the pre-mainshock 
and post-mainshock periods—effectively removes the vast majority of aftershocks 
following the mainshocks, leaving only the remote seismicity in both the 
pre-mainshock and post-mainshock periods. In the main text, this procedure 
was applied separately to the specified periods preceding and following each 
mainshock, resulting in running-average remote seismicity rates from 10 d before 
to 10d after a mainshock as well as 10-d pre-mainshock average remote seismicity 
rates. The latter may be considered ‘background’ remote seismicity rates, but the 
question arises as to the level and statistical fluctuations of this background using a 
longer catalogue. 

Here we address the statistics of background seismicity rates using longer time 
intervals and with a Monte Carlo simulation approach. For this purpose, we use 
the NEIC catalogue for the period April 1992 to April 2012, extracting all events 
with M = 5.5 and depth =100km. To render the estimation of background rates 
comparable with pre-mainshock rates presented in the main text, it is appropriate 
to de-cluster the catalogue. In doing so, we are guided by the fact that none of the 
six very large mainshocks was preceded by an event of magnitude M > 8.1 during 
the preceding 20d. (The 26 December 2004 M = 9.2 Sumatra earthquake was 
preceded by the 23 December 2004 M = 8.1 Tasman Sea event.) Our strategy 
for de-clustering is to extract from the catalogue all sets of 10-d-long periods 
(t— 10d, ft) such that no event of magnitude greater than Myin occurs in the 
20d preceding t. This is designed to remove time intervals that contain or are 
preceded by large-magnitude events. Compared with Gardner and Knopoff 
de-clustering”, it uses a more restrictive spatial exclusion zone (infinite distance) 
but a less restrictive temporal exclusion zone. In order to mimic the sampling 
process of five very large mainshocks (that is, all the M = 8.5 events excluding the 
2012 event) in one realization, five of these sets are drawn randomly and the rates 
of M = 5.5 events occurring in them averaged. That is, if there are N available 10-d 
sets, then one realization yields the estimate 


where R,, is the average rate in set m and v(i) is random variable uniformly 
distributed over the integers 1 through N. This process is repeated 100,000 times 
to generate a probability distribution of R. 

Resulting probability distributions of R are shown in Supplementary Fig. 4. 
There is a decrease in the mean rate of M = 5.5 earthquakes from R= 1.20 events 
per day in the unedited catalogue (Supplementary Fig. 4a) to R= 1.12 events per 
day in the de-clustered catalogue with Min = 8.1 (Supplementary Fig. 4b). The 
value R= 1.12 and corresponding 5% and 95% empirical probability bounds of 
0.82 and 1.47 events per day define the grey region plotted in Fig. 2b. 

The entire procedure may be repeated by sampling all available 10-d averages 
once, that is, 


R=R, (n=1,...,N) 


to replicate the averaging process used for the 2012 mainshock alone. The resulting 
Rand corresponding 5% and 95% empirical probability bounds for this case define 
the grey region plotted in Fig. 2a. 

Seismicity rate change. The question of the significance of the observed short- 
term rate increases in remote seismicity following M = 8.5 mainshocks (Fig. 2) was 
previously addressed over essentially all distance ranges using a large set of M=7 
mainshocks''. To answer it we must characterize the probability distribution of the 
change in the remote seismicity rate, with ‘remoteness’ being referenced to a set of 
M2=7 mainshocks. In doing so, our approach is similar to that of ref. 11, but we 
focus on the remote seismicity rate change. 


We again use the NEIC catalogue for the period April 1992 to April 2012, extract- 
ing all events with M = 5.5 and depth =100 km. There are 243 M= 7 mainshocks 
during this period. For each mainshock of M = 7 considered separately, we define a 
spherical cap of angular radius 1,500 km about its epicentre and exclude all events 
within that zone. We may further de-cluster this ‘unedited catalogue’ according to 
the prescription given in the preceding section: for a given Mmin, mainshocks with 
any M= Myin event within 20d preceding it are excluded. We then evaluate the 
average seismicity rate in the preceding 10 d (R,,.) and the following 2 d (Ryost), and 
define the seismicity rate change AR = Ryost — Rpre (equation (1) of main text). To 
mimic the sampling process of one or five very large mainshocks, we follow the 
Monte Carlo approach described in the preceding section. 

Resulting probability distributions of AR for one mainshock are shown in 
Supplementary Fig. 5. In the two considered cases, there is a statistically 
insignificant average rate decrease (AR = —0.03 events per day for the unedited 
catalogue in Supplementary Fig. 5a; 4R = —0.06events per day for the de- 
clustered catalogue in Supplementary Fig. 5b). The 95% upper bound on the 
empirical probability distribution for a rate increase for a single mainshock is 
1.25events per day for both the unedited and the de-clustered catalogues. 
Similarly, the 95% upper bound on the empirical probability distribution for a 
rate increase for an aggregate of five mainshocks is 0.55 events per day for both the 
unedited and the de-clustered catalogues (Supplementary Fig. 6). 

It is of interest to understand the ranges of Rpre and Rpost Separately as well as 
their covariance for the considered set of M27 mainshocks. Supplementary 
Fig. 7a shows the distributions of Rp. and Rpost obtained through the above 
Monte Carlo scheme for a single event with Min = 8.1. The 95% empirical upper 
bound on Ryo, equal to 2.75 events per day, is much higher than the correspond- 
ing upper bound on AR, equal to 1.25 events per day (Supplementary Fig. 5b). The 
primary reason is that the means of Rp,e and Ryo are almost identical, such that 
the distribution of AR is centred on a near-zero mean (Supplementary Fig. 5b). In 
addition, Rpre and Ryos are moderately correlated, such that the variance of their 
difference is slightly smaller than the variance of either quantity alone. The same is 
true for the 95% upper bound on R,,,5 for an aggregate of five events (Supplemen- 
tary Fig. 7b), equal to 1.65 events per day, which is much higher than the corres- 
ponding upper bound on AR, equal to 0.55 events per day (Supplementary Fig. 6b). 
Distance dependence of global seismicity. We characterize the global seismicity 
following the six very large mainshocks in terms of the distances of M = 5.5 events 
from the respective mainshock epicentres. This is displayed graphically in 
Supplementary Fig. 8 in terms of event rates per unit area in logarithmic distance 
bins for each of three post-mainshock time intervals and a single pre-mainshock 
interval. For both the 11 April 2012 M = 8.6 mainshock and the other five M = 8.5 
mainshocks, post-mainshock global seismicity rates are systematically higher than 
pre-mainshock rates at remote distances, that is, well beyond the near-field after- 
shock zone. 

Calculation of the cumulative number of M= 4.5 remote events. We extract 
from the NEIC catalogue all events of depth =100 km and M = 4.5, and construct 
three subsets of events: those in the 6 d preceding the 2012 mainshock, those in the 
6d following the 2012 mainshock and those within 6-d intervals following M= 7 
events during the year preceding the 2012 event. To restrict attention to remote 
events, we exclude all events lying within a spherical cap of radius 1,500 km from 
the 2012 centroid (first and second cases) or a given M = 7 mainshock (third case, 
that is, background ‘remote’ events). The cumulative numbers of events as func- 
tions of magnitude for these three cases are compared in Fig. 3. 

Smaller mainshock-aftershock sequences. Both pre-mainshock and _post- 
mainshock rates in Fig. 2 may be influenced by clustering from smaller 
mainshock—aftershock sequences. Better to assess the remote seismicity pattern 
without such clustering, it is expedient to remove smaller mainshock-aftershock 
sequences. We do so by applying an additional filter to recalculate the 2-d running- 
average seismicity rates, by further excluding any event that is preceded by an 
M=5.5 event during the previous 24h and within a 100-km radius. Resulting 
seismicity rates for the 30-d periods preceding and following the M=8.5 
mainshocks are shown in Supplementary Fig. 2. The ~-2-d heightened remote 
seismicity following all mainshocks is seen to be anomalous over the entire time 
period, showing that these increased seismicity rates are not attributable to 
induced smaller mainshock-aftershock sequences. 

Low pre-earthquake seismicity rate. Seismicity rates were exceptionally low in 
the days before the 2012 mainshock. To demonstrate this, we divided the NEIC 
catalogue into 6-d bins or 12-d bins, which yields sufficient sampling of M = 4.8 
and M = 5.5 events, respectively. We measured the completeness level of the bins 
from the 2004 M,, = 9.2 Sumatra mainshock to the 2012 east Indian Ocean main- 
shock; M, = 4.8 for 95% of the 6-d bins. Using either the M = 4.8 or the M=5.5 
threshold, we find that the number of events during the bin preceding the 2012 
mainshock is the ninth lowest at M = 4.8 and the lowest at M = 5.5 for a 7.3-yr 
period (Fig. 4). In other words, respectively 98.4% or 100% of the bins had a higher 
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background rate, such that the global seismicity rates preceding the 2012 event 
were exceedingly low compared with the prevailing rates over these 7.3 yr. 

Calculation of global seismic wavefields. Seismic wavefields are calculated using 
the direct Green’s function method of ref. 34 on the spherically symmetric iso- 
tropic PREM model”*. Global seismic wavefields are calculated using a point- 
source approximation of the 11 April 2012 mainshock convolved with a source 
time function consisting of a cosine ramp function of duration 100 s, and low-pass 
filtered at 80 s. The source epicentre, depth and moment tensor we use are that of 
the Global CMT Solution®: 2.24°N, 92.78°E; 40km; M,,=0.136X 10” dyncm, 
My = —0591 X 10° dyncm, Myp = 0.455 10° dyncm, My = —0.396X 10° dyncm, 
Myp = 0.046 X 10” dyncm, My = —0.615 X 10° dyncm. Although the actual rupture is 
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much more complex'*’, at these long periods our simple model captures the 
first-order character of the global seismic wavefield (Supplementary Fig. 9). 
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Closing yield gaps through nutrient and water 


management 


Nathaniel D. Mueller’, James S. Gerber', Matt Johnston’, Deepak K. Ray!, Navin Ramankutty” & Jonathan A. Foley! 


In the coming decades, a crucial challenge for humanity will be meet- 
ing future food demands without undermining further the integrity 
of the Earth’s environmental systems’ °. Agricultural systems are 
already major forces of global environmental degradation*’, but 
population growth and increasing consumption of calorie- and 
meat-intensive diets are expected to roughly double human food 
demand by 2050 (ref. 3). Responding to these pressures, there is 
increasing focus on ‘sustainable intensification’ as a means to 
increase yields on underperforming landscapes while simultaneously 
decreasing the environmental impacts of agricultural systems” **"". 
However, it is unclear what such efforts might entail for the future of 
global agricultural landscapes. Here we present a global-scale assess- 
ment of intensification prospects from closing ‘yield gaps’ (differ- 
ences between observed yields and those attainable in a given region), 
the spatial patterns of agricultural management practices and yield 
limitation, and the management changes that may be necessary to 
achieve increased yields. We find that global yield variability is 
heavily controlled by fertilizer use, irrigation and climate. Large pro- 
duction increases (45% to 70% for most crops) are possible from 
closing yield gaps to 100% of attainable yields, and the changes to 
management practices that are needed to close yield gaps vary con- 
siderably by region and current intensity. Furthermore, we find that 
there are large opportunities to reduce the environmental impact of 
agriculture by eliminating nutrient overuse, while still allowing an 
approximately 30% increase in production of major cereals (maize, 
wheat and rice). Meeting the food security and sustainability chal- 
lenges of the coming decades is possible, but will require considerable 
changes in nutrient and water management. 

Opportunities for agricultural intensification were analysed for 
seventeen major crops (which covered approximately 76% of global 
harvested cropland area between 1997 and 2003 (Food and 
Agriculture Organization of the United Nations)). Yield gaps (Fig. 1) 
were estimated by comparing landscape-level observed yields’* to 
‘attainable yields’, determined by identifying high-yielding areas 
within zones of similar climate. As empirical estimates, attainable 
yields are more conservative than absolute biophysical ‘potential 
yields’, but they are probably achievable using current technology 
and management techniques. 

Considerable yield-improvement opportunities exist relative to 
current attainable yield ceilings, with opportunities differing dramatically 
by crop and geography (regional and country-specific data for all 
seventeen crops are summarized in the Supplementary Information). 
Globally, we find that closing yield gaps to 100% of attainable yields 
could increase worldwide crop production by 45% to 70% for most major 
crops (with 64%, 71% and 47% increases for maize, wheat and rice, 
respectively). Eastern Europe and Sub-Saharan Africa show considerable 
‘low-hanging’ intensification opportunities for major cereals (Fig. 2); 
these areas could have large production gains if yields were increased 
to only 50% of attainable yields. East and South Asia also have substantial 
intensification opportunities owing to their vast agricultural lands and 
the geographic variability in their yields and yield gaps. 


Assessing opportunities for more sustainable intensification 
requires an understanding of the factors driving yield variation across 
the world. Fundamentally, yield gaps are caused by deficiencies in the 
biophysical crop growth environment that are not addressed by agri- 
cultural management practices. Here we explicitly examined key bio- 
physical drivers of crop yield by using global, crop-specific irrigation 
data’* and by developing a new global, crop-specific data set of nitro- 
gen (N), phosphate (P,O;) and potash (K,O) fertilizer application 
rates. We find extensive geographic variation in these management 
practices, with high fertilizer application rates concentrated in 
high-income and some rapidly developing countries (Fig. 3a and 
Supplementary Fig. 1). Likewise, irrigated areas’ are heavily concen- 
trated in South Asia, East Asia and parts of the United States (Fig. 3b). 

Using input-yield crop models, we found that the spatial patterns of 
climate, fertilizer application and irrigated area explain 60% to 80% of 
global-yield variability for most major crops (Supplementary Informa- 
tion and Supplementary Table 1). Yields of some crops (for example, 
sorghum, millet and groundnut) were primarily controlled by climate, 
whereas others (for example, barley, sugar beet and oil palm) showed 
strong management responses. Surprisingly, model residuals showed 
little sensitivity to soil and slope parameters (Supplementary Informa- 
tion and Supplementary Fig. 2), suggesting that such relationships are 
obscured on the landscape scale with existing data sets. 

The factors that primarily limit increasing crop yields to within 75% 
of their attainable yields (Fig. 4, Supplementary Fig. 3) vary by crop 
and region. For example, Eastern Europe and West Africa stand out as 
hotspots of nutrient limitation for maize, whereas Eastern Europe 
seems to experience nutrient limitation for wheat. Co-limitation of 
nutrients and water is observed across East Africa and Western 
India for maize, portions of the US Great Plains and the 
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Figure 1 | Average yield gaps for maize, wheat and rice. These were 
measured as a percentage of the attainable yield achieved circa the year 2000. 
Yield gap in each grid cell is calculated as an area-weighted average across the 
crops and is displayed on the top 98% of growing area. 


lMnstitute on the Environment (lonE), University of Minnesota, St. Paul, Minnesota 55108, USA. *Department of Geography and Global Environmental and Climate Change Center, McGill University, 


Montreal, Quebec H3A 2K6, Canada. 


254 | NATURE | VOL 490 | 11 OCTOBER 2012 


©2012 Macmillan Publishers Limited. All rights reserved 


o 
oo 
a 


HEB Production circa the year 2000 


ied 
w 


LETTER 


MEE Increased production; yield gaps closed to 50% of attainable yields 
MQM Increased production; yield gaps closed to 75% of attainable yields 


D 
e ()) Increased production; yield gaps closed to 90% of attainable yields 
g 0.25 || Increased production; yield gaps closed to 100% of attainable yields 
i= 
2 
B 02 
Cc 
2 
$0.15 
nel 
Q 
Qa 
Qo 0.1 
2 
oO 

0.05 

MW R MW R MW R MW R MW R MW R MW R MW R MW R 
North America Latin America Western Europe [Eastern Europe Middle East and Sub-Saharan South Asia East Asia Southeast Asia 
and Caribbean and Central Asia North Africa Africa and Oceania 


Region and crop 


Figure 2 | Global production increases for maize, wheat and rice from 
closing yield gaps to 50%, 75%, 90% and 100% of attainable yields. The 
greatest opportunities for increases in absolute production (from closing yield 
gaps to 100% of estimated attainable yields) are wheat (W) in Eastern Europe 
and Central Asia, rice (R) in South Asia and maize (M) in East Asia. Absolute 


Mediterranean Basin for wheat, and in Southeast Asia for rice. We note 
that the management practices that limit yield increases depend on the 
degree of yield-gap closure desired (Supplementary Fig. 4). For 
example, closing maize yield gaps to 50% of attainable yields (approxi- 
mately 2.5 tonnes per hectare) in Sub-Saharan Africa primarily 
requires addressing nutrient deficiencies (Supplementary Fig. 4a), 
but closing yield gaps to 75% of attainable yields (approximately 3.6 
tonnes per hectare) requires increases in both irrigated area and nutri- 
ent application over most of the region (Fig. 4a). 

We examined potential changes in irrigated area and nutrient 
application that are needed to close yield gaps of maize, wheat and 
rice to within 75% attainable yields (a 29% global production increase) 
using our input-yield models. On the landscape scale, yield gaps in co- 
limited regions can be closed through a range of irrigated-area and 
nutrient-intensity combinations (see Supplementary Fig. 5). For 
example, 73% of these underachieving areas could close yield gaps 
by solely focusing on nutrient inputs (with 18%, 16% and 35% 
increases in N, P,O; and KO application relative to baseline global 
consumption, respectively), whereas only 16% of underachieving areas 
could close yield gaps by solely increasing irrigation. Jointly increasing 
irrigated area and nutrient application could close yield gaps on all 
underachieving areas (with 30%, 27% and 54% increases in N, P20; 
and K,O application, respectively, and a 25% increase in irrigated 
hectares; Fig. 5a, b). 


production increases for individual crops in Sub-Saharan Africa are smaller 
owing to lower attainable yields and diverse cropping systems (that is, less area 
devoted to any one crop). The region could still achieve large production 
increases in cassava, maize and sugarcane. 


To minimize the environmental impacts of intensification, increased 
irrigation and nutrient application to close crop yield gaps should be 
complemented by efforts to decrease overuse of crop inputs wherever 
possible'*"*; combined, these efforts could increase total food produc- 
tion while decreasing the overall global use of water and nutrients. For 
example, we estimate that by addressing imbalances and inefficiencies, 
nitrogen- and phosphate-fertilizer application on maize, wheat and rice 
could decrease globally by 11 million tonnes of nitrogen (28%) and 
5 million tonnes of phosphate (38%) without impacting current yields 
(Supplementary Fig. 6). Nutrient overuse on these crops is particularly 
dramatic in China, confirming field-scale results’. To close yield gaps to 
75% of attainable yields while also eliminating input overuse (under joint 
nutrient and irrigation intervention), we project that smaller net changes 
in nutrient inputs would be required: 9%, —2% and 34% changes in N, 
P.O; and K,O application (Fig. 5c and Supplementary Fig. 7). Notably, it 
would be possible to close global yield gaps on major cereals to within 
75% of attainable yields with fairly minimal changes to total worldwide 
nitrogen and phosphate use by coupling targeted intensification with 
efforts to reduce nutrient imbalances and inefficiencies. Geographically 
optimizing input intensity and increasing field-scale efficiencies (beyond 
the average efficiencies implicit in our input-yield models) could 
improve production further relative to inputs. 

Closing yield gaps may not always be desirable or practical in the 
short term, given marginal returns for additional inputs, regional 


Major cereals: average N fertilizer application rate (kg N ha-') 


> 


0 20 40 60 80 100 120 140 160 180 200 


Major cereals: area irrigated (%) 
0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% 


Figure 3 | Management intensity of nitrogen fertilizer and irrigated area’ varies widely across the world’s croplands. a, b, Fertilizer (a) and irrigation 


(b) values are area-weighted averages across major cereals. 
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HE Nutrient limited 
HE Nutrient and irrigation limited 
GE 75% attainable yields achieved 


HEB Nutrient limited 
HB Nutrient and irrigation limited 
MB 75% attainable yields achieved 


HE Nutrient limited 
HY Nutrient and irrigation limited 
MB 75% attainable yields achieved 


Figure 4 | Management factors limiting yield-gap closure to 75% of 
attainable yields for maize, wheat and rice. a, b, c Yield-limiting management 
factors for maize (a), wheat (b) and rice (c) were calculated using the suite of 
input-yield models, comparing current input intensity against estimated 
required levels to close yield gaps. 


land-management policies, limits on sustainable water resources and 
socio-economic constraints (for example, access to capital, infrastruc- 
ture, institutions and political stability). However, use of precision 
agriculture techniques, conservation tillage, high-yielding hybrids, 
increased plant populations and multifunctional landscape manage- 
ment can help to mitigate negative environmental impacts of intensive 
agriculture’”*'. Additionally, use of organic fertilizers (omitted in this 
analysis owing to data limitations) are essential for improving soil 
carbon, enhancing soil biota and increasing water-holding capacity”. 
Social triggers of intensification will differ across regions; for example, 
because of development interventions by governments or NGOs, 
market-driven incentives for farmer investment, and land scarcity in 
regions not fully connected to global markets”. 

Changes to agricultural management to close yield gaps should be 
considered in the context of climate change, which is expected to 
substantially impact yields**° and induce management adaptations”®. 
Specifically, a major concern is how changes in water availability may 
conflict with projected irrigation requirements for closing yield gaps. 
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Figure 5 | Closing yield gaps through changes in agricultural management. 
a, b, Projected increases in nitrogen application rates (a) and irrigated areas 
(b) necessary to close maize, wheat and rice yield gaps to 75% of attainable 
yields. c, Projected net changes in nitrogen application rates when closing yield 
gaps and eliminating input imbalances and inefficiencies. 


The fertilizer data set, yield gap estimates and yield models pre- 
sented here could be used widely to assess intensification opportunities 
and the environmental impacts of changing agricultural systems. 
However, these data and analyses are not without limitations (full 
discussion in Supplementary Information). Most importantly, the 
analyses rely on agricultural management, yield and climate data from 
a variety of different sources and on different scales. Overall, these 
results are most useful across regional and global scales, leaving fine- 
scale and temporal details obscured (for example, intra- and inter- 
annual variation in climate and yield creates particular uncertainty 
about irrigation requirements). Moreover, although our models 
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confirm the importance of climate, fertilizers and irrigation in deter- 
mining contemporary patterns of global cropland productivity, we do 
not discount the importance of additional biophysical characteristics 
(including soil characteristics, see Supplementary Information) and 
management practices (including crop rotation patterns, organic 
nutrient inputs, micronutrients, improved seed quality, conservation 
tillage and pest management). Incorporating these factors into the 
analytical framework could improve the accuracy and utility of the 
analyses. Additional research on cropland intensification must also 
assess the opportunities and environmental tradeoffs for increasing 
cropping intensity and decreasing pre- and post-harvest crop losses. 

The future of agriculture faces two great challenges: substantial 
increases in food demand must be met while decreasing agriculture’s 
global environmental footprint. Closing yield gaps and increasing 
resource efficiency are necessary strategies towards meeting this 
challenge, but they must be combined with efforts to halt agricultural 
expansion, reduce food waste and promote sensible diets, and produce 
advanced crop varieties'*. This analysis emphasizes the crucial role of 
nutrient and water management in pathways towards sustainable 
intensification, and provides a starting point for a more comprehensive 
discussion of intensification opportunities and challenges. Context- 
dependent policies and agricultural development programs must 
address drivers of yield limitation while encouraging management 
practices that improve tradeoffs between production and environ- 
mental impacts. 


METHODS SUMMARY 


Yield gaps were quantified by comparing existing yields to climate-specific attain- 
able yields. Our approach refines previous estimates””** by excluding climate 
outliers and using crop-specific, equal-area climate zones. 

Fertilizer application rate and consumption data were compiled for nations and 
subnational units across the globe (Supplementary Table 2). Application rates for 
crop-country combinations missing data were estimated as described in the 
Supplementary Information. Crop- and crop-group-specific application rates 
were then distributed across detailed maps of crop’’ and pasture’ areas, and rates 
were harmonized with subnational and national nutrient consumption data. 

Fertilizer and irrigation data were used to parameterize nutrient response curves 
and rainfed maximum yields, using nonlinear regression analyses within each 
climate zone. Using these relationships, we estimated changes in inputs necessary 
to close yield gaps, as well as decreases in inputs possible from addressing 
inefficiencies and imbalances. 
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Complex brain and optic lobes in an early 


Cambrian arthropod 


Xiaoya Ma’, Xianguang Hou’, Gregory D. Edgecombe? & Nicholas J. Strausfeld® 


The nervous system provides a fundamental source of data for under- 
standing the evolutionary relationships between major arthropod 
groups'”. Fossil arthropods rarely preserve neural tissue. As a result, 
inferring sensory and motor attributes of Cambrian taxa has been 
limited to interpreting external features, such as compound eyes’ or 
sensilla decorating appendages*, and early-diverging arthropods 
have scarcely been analysed in the context of nervous system evolu- 
tion. Here we report exceptional preservation of the brain and optic 
lobes of a stem-group arthropod from 520 million years ago (Myr 
ago), Fuxianhuia protensa’, exhibiting the most compelling neu- 
roanatomy known from the Cambrian. The protocerebrum of 
Fuxianhuia is supplied by optic lobes evidencing traces of three 
nested optic centres serving forward-viewing eyes. Nerves from uni- 
ramous antennae define the deutocerebrum, and a stout pair of 
more caudal nerves indicates a contiguous tritocerebral component. 
Fuxianhuia shares a tripartite pre-stomodeal brain and nested optic 
neuropils with extant Malacostraca and Insecta”®, demonstrating 
that these characters were present in some of the earliest derived 
arthropods. The brain of Fuxianhuia impacts molecular analyses 
that advocate either a branchiopod-like ancestor of Hexapoda’® or 
remipedes and possibly cephalocarids as sister groups of Hexapoda”. 
Resolving arguments about whether the simple brain of a bran- 
chiopod approximates an ancestral insect brain or whether it is 
the result of secondary simplification has until now been hindered 
by lack of fossil evidence. The complex brain of Fuxianhuia accords 
with cladistic analyses on the basis of neural characters, suggesting 
that Branchiopoda derive from a malacostracan-like ancestor but 
underwent evolutionary reduction and character reversal of brain 
centres that are common to hexapods and malacostracans’. The 
early origin of sophisticated brains provides a probable driver for 
versatile visual behaviours, a view that accords with compound eyes 
from the early Cambrian that were, in size and resolution, equal to 
those of modern insects and malacostracans’. 

Fuxianhuia protensa is one of more than 80 arthropod species 
retrieved from the Chengjiang biota (Cambrian Series 2, Stage 3) at 
various sites in Yunnan Province, southwest China''. Fossils of 
Fuxianhuia are commonly preserved dorsoventrally flattened (Fig. 1). 
Dorsal aspects show a ‘head shield’ comprising two hemi-shields 
together covering the anteriormost four segments, the first of which 
bears a pair of postocular uniramous antennae’? (Fig. 2a). Ventrally, the 
first post-antennal segment provides paired curved articulated struc- 
tures that have been identified either as appendages flanking the 
mouth*”* or as gut diverticuli’*’*. A thorax comprises up to 16 seg- 
ments, with fewer tergites than biramous appendages”’*. The thoracic 
segments have broad paratergal folds. An abruptly narrowed abdomen 
consists of 13 segments, and a terminal spine is flanked by a pair of 
shorter spines. The body of F. protensa is thus extremely simple, yet 
evidences obvious tagmosis (Fig. 1). Frontally, each of a pair of eyes is 
situated at the end of a short wide stalk that broadens medially to 
merge with a rectangular optic capsule (Fig. 2a, c), sometimes claimed 
as pre-segmental”’, that lies at the midline beneath a forward-thrusting 


rostrum, also referred to as an anterior sclerite’*. Paired protrusions, 
situated medial to each eye and extending forward dorsally, were once 
interpreted as a second or ancillary pair of eyes’*. Although in some 
fossils these protrusions are the same colour as the lateral eye (Fig. 3a), 
in others the colouration matches that of the rostrum (Fig. 3c). No 
corneal facets are preserved and, accordingly, we interpret these struc- 
tures as lateral extensions of the rostrum and not compound eyes. 
Observations of the tergites in specimens preserved ventral side up show 
little indication of sensory bristles or setae on their surfaces, whereas 
thoracic endopods and the paired antennae show occasional insertion 
points of sensory setae on each article and annulus (Supplementary 
Fig. 1a). Comparable sensilla have been identified in the Burgess Shale 
arthropod Waptia fieldensis, indicating abundant sensory inputs to 
segmental ganglia’. 

As is typical of Chengjiang fossils, internal organs of Fuxianhuia show 
exceptional preservation, delineated by contours, as in the digestive tract, 
and often by coloured delineations from diagenetic mineralization that 
effectively stains or outlines preserved internal structures'*". These can 
include segmentally arranged digestive glands/diverticuli, retinas and 


Figure 1 | Fuxianhuia protensa from the Chengjiang Lagerstitte. Dorsal 
view of complete specimen, YKLP 11321. Al, antenna; Ab, abdomen; Es, eye 
stalk; Ey, eye; Hs, head shield; Oc, optic capsule; Th, thorax. Scale bar, 1 cm. 
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lenses, and, in rare instances, traces of muscle and nervous system. This 
last structure is arguably the densest internal tissue in any extant arth- 
ropod, comprising vast numbers of lipid profiles, cytoskeletal proteins 
and mitochondria. Profiles of internal structures, matching the profiles 
of contemporary brains and ganglia, have previously been identified in 
the Burgess Shale* and Chengjiang’*”’, belying the common supposi- 
tion that neural tissue is too prone to decay to withstand fossilization. 

Observations of some 50 specimens, many newly collected and stored 
at the Yunnan Key Laboratory for Palaeobiology (YKLP), demonstrate 
well-preserved eyes and eye stalks in various orientations (Figs 2c and 3), 
suggesting that the eye stalk assemblage possessed a considerable degree 
of rotational freedom and thus allowed active vision (Supplementary 
Fig. 3). For example, the eye stalk in Fig. 3a is tilted forwards, such that 
the eye appears as viewed from directly above, demonstrating a much 
larger radius of curvature frontally than laterally. Such differences of 
curvature are commonplace in extant insects and crustaceans, the flatter 
surfaces allowing small divergence angles between adjacent ommatidia 
and thus greater resolution of the frontal visual field than the lateral 
one’”. In the closely allied Chengjiangocaris longiformis, the medial pro- 
trusion and the lateral eye base are associated with a Y-shaped structure 
on the ventral side that possibly afforded mechanical support and articu- 
lation (Fig. 3b). Eyes viewed from their frontal aspect—that is, preserved 
tilted upwards as in Fig. 3-—show the presence of a compound eye 
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Figure 2 | Brain of Fuxianhuia protensa and 
comparison with Malacostraca. a—c, YKLP 
15006. a, Head region, showing paired eyes (Re) 
and an antenna (A1). b, Detail of brain. 
Arrowheads indicate paired darker regions at level 
of entry of antennal nerves. c, Optic capsule, dark 
areas (arrowed 1-3) corresponding to nested optic 
neuropils. Arrows leading from asterisks indicate 
contiguous optic tract remnants, including 
heterolateral component within main body. 

d, Reconstructed brain of Fuxianhuia showing 
organization of optic lobes, optic tracts and 
peripheral nerve tracts Aln and A2n. 

e, Reconstructed brain of the land hermit crab 
Coenobita clypeatus (ref. 25) showing homologies 
in Malacostraca. Central projections of tracts from 
optic stalks, including heterolateral connections 
between the stalks, are shown as an intermediate 
shade of blue; in Fuxianhuia heterolateral 
connections are provided by a separate tract 
extending between the optic tracts. f, Top-down 
view of brain of glass shrimp, Palaeomonetes pugio. 
Transparent cuticle shows three nested neuropils 
(arrowed 1-3) in eye stalks. Protocerebrum 
denoted by its paired lobes (pr lo). Black 
arrowheads indicate olfactory lobes. Nerve roots of 
Aln and A2n to right. Aln, antennal nerve; A2n, 
second pair of nerve roots; op t, optic tract; pr lo, 
protocerebral lobes. Scale bars, 1 mm. 


comprising facets. A thin opalescent layer (Fig. 3d, e) overlies two deeper 
stratifications interpreted as corresponding to dioptric arrangements: 
corneal lenses surmounting crystalline cones. Patches of lenses identified 
in exceptionally preserved specimens resolve a regular arrangement 
of 25 um diameter lenses on both the lateral and frontal eye margins 
(Fig. 3f). These layered arrangements correspond to those observed in 
malacostracan crustaceans (Fig. 3g). 

One specimen (YKLP 15006), previously described in ref. 12, shows 
exceptionally preserved traces of tissue within the eye stalks, and tract- 
like extensions from these to a bilaterally symmetric area situated front- 
ally, beneath the head shield (Fig. 2a), identified as the brain. Its shape 
(Fig. 2b) matches that of a comparable-sized modern malacostracan 
(Fig. 2f). The brain in Fig. 2b shows paired anterior protocerebral lobes, 
behind which the tracts from the eye stalks converge. Caudal to these 
emerge robust traces of a pair of antennal nerves, one of which (Aln) 
can be followed into the antenna (Fig. 2b). Immediately behind these 
nerves is a third pair of stout tracts. These do not emanate from 
the antennae, but probably belong to post-antennal appendages. The 
presence of these paired nerve roots (A2n) supports observations 
identifying the paired sickle-shaped structures flanking the mouth 
ventrally as appendicular*”*. The brain of YKLP 15006 shows definitive 
bilateral symmetry both in its outlines and in the three pairs of nerves 
supplying it. A pair of symmetrically positioned darker textured 


11 OCTOBER 2012 | VOL 490 | NATURE | 259 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


Figure 3 | Eyes of fuxianhuiids and comparison to Malacostraca. a, Paired 
eyes of Fuxianhuia protensa, YKLP 11322. Top-down view of left eye stalk 
showing retina (Re), flanking protrusion and midline rostral sclerite 
(arrowheads). Dark areas (arrowed 1-3) suggest three nested neuropils. 

b, Ventral view of eye of Chengjiangocaris longiformis, RCCBYU 10256, 
showing Y-shaped sclerite extending from ventral surface of first segment. 
c-f, Fuxianhuia protensa, YKLP 11323. c, Exposed interior of eye stalk, 


domains is observed within the brain at the level of entry of the antennal 
nerves (Fig. 2b). The position of these paired regions best corresponds 
to the location of paired olfactory lobes observed in the deutocerebrum 
of numerous crown-group malacostracans*"*. 

The eye stalks of YKLP 15006 and other specimens contain alter- 
nating swollen and constricted patches of pigmentation that are inter- 
preted as volumes of optic neuropils. Their identity as neural tissue is 
substantiated by the continuity of the optic tract with the anterior part 
of the brain. Three successive domains of pigmentation are indicative 
of three nested neuropils (numbered as in Figs 2c and 3c and 
Supplementary Figs 1 and 2). Depending on the orientation of the 
head, these domains can appear to be positioned towards the rear 
margin of the eye stalk (Fig. 2c) or further forward when the eyes 
and head capsule are rotated upwards around the horizontal axis 
(Fig. 3c and Supplementary Fig. 2a, b). Neural tissue is shown as dark 
material (Fig. 2a—c) or the profiles of optic regions and optic tracts may 
be the only areas lacking diagenetic deposits, as shown symmetrically 
in both eye stalks in Fig. 3c (Supplementary Fig. 2a, b). In YKLP 15006, 
the putative optic neuropils are contiguous with darkly coloured optic 
nerves that extend caudally to eventually merge into the brain area at 
the level of its anterior protocerebral lobes (Fig. 2c and Supplementary 
Fig. 1). Scanning electron microscopy and energy-dispersive X-ray 
spectroscopy show that both the brain and the optic lobes are preserved 
with higher iron concentration than other surrounding structures, 
including the cuticle (Supplementary Fig. 4). 

The presence of three successive pairs of nerve roots, corresponding 
to the optic nerves, paired uniramous antennae and evidently to paired 
post-antennal appendages, demonstrates that the brain of Fuxianhuia 
comprises three neuromeres: a protocerebrum, deutocerebrum and 
tritocerebrum. It is not possible to ascertain whether the entire trito- 
cerebral ganglion lies in front of the stomodeum or whether, as in 
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resolving three raised, nested areas. d, Right eye margin showing opalescent 
layer interpreted as the corneal surface (0). e, Left eye, two layers beneath outer 
layer correspond to the lens layer (Il) and crystalline cone (cc), together providing 
the dioptric layer. f, Left eye and its margins showing regularly spaced structures 
corresponding to lenses. g, Silver-stained lateral eye margin of a malacostracan 
(Lebbeus groenlandicus) showing argyrophilic lens layer beneath a thin corneal 
layer and a layer of crystalline cones. Scale bars, a~d 250 um, e-h 50 pum. 


many extant pancrustaceans, the tritocerebrum is penetrated by the 
stomodeum, with one part of it pre-stomodeal and the other part post- 
stomodeal. In other mandibulate arthropods, such as centipedes, the 
tritocerebrum flanks the gut, but its neuropil is contiguous with, not 
separate from, the deutocerebrum”™. In no taxon is the tritocerebrum 
wholly pre-oral: in cases where the entire tritocerebrum appears to lie 
forward of the gut, there is still one bundle of post-stomodeal axons 
that connects both sides of the tritocerebrum”®. 

The brain of Fuxianhuia is thus tripartite, as are brains of Mala- 
costraca'®, Chilopoda’’ and Insecta”’, but not those of Branchiopoda, 
in which the tritocerebral ganglion is separate from the deutocerebrum 
and situated post-orally”'. Likewise, the indication of three nested optic 
neuropils in Fuxianhuia (Fig. 2) contrasts with Branchiopoda, which 
have only two such neuropils linked by uncrossed axons”, rather than 
three nested neuropils linked by chiasmata, as in malacostracans, 
including Phyllocarida, and insects**** (Fig. 2f). The reconstructed 
brain of Fuxianhuia (Fig. 2d) thus corresponds in many respects to 
that of malacostracans” (Fig. 2e). 

The crucial differences between the branchiopod and malacostra- 
can central nervous system are evidenced in other important ways. 
Branchiopoda lack olfactory lobes and hemiellipsoid bodies, both 
of which typify not only Malacostraca but also Remipedia*® and 
Copepoda’’. Even Cephalocarida possess glomerular antennal lobes 
that are connected to paired mushroom bodies, the homologues of 
hemiellipsoid bodies****. The elaborate mid-line neuropils of Mala- 
costraca, consisting of a protocerebral bridge and bilateral central 
body, contrast with the simpler central body in Branchiopoda”’. The 
more simple arrangement of two retinotopic neuropils connected by 
uncrossed axons has been used, classically”’, to advocate branchiopods 
as basal within Pancrustacea (=Tetraconata sensu ref. 30), diverging 
before a more elaborate tripartite brain and additional optic neuropil 
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(the lobula) evolved in Malacostraca and Insecta. If Branchiopoda 
indeed show plesiomorphic character states, fossil brains from deep 
time ought to provide supporting evidence. Here, observations of the 
fossilized brain of a putative stem-group arthropod, Fuxianhuia, 
instead provides evidence of a tripartite brain anterior to the stomo- 
deum, supplied by optic nerves originating from lobes that appear to 
comprise three nested neuropils, indicating that, if homologous, these 
characters may pre-date the most recent common ancestor of extant 
Pancrustacea. 

The absence of specialized head appendages posterior to the paired 
curved post-antennal appendages” and the structure of the trunk 
appendages (with simple exopod flaps and no differentiation of a 
protopodite) have argued for a position of Fuxianhuia in the arthropod 
stem-group”’?-"*; that is, branching earlier than the most recent com- 
mon ancestor of Chelicerata and Mandibulata (phylogeny as in ref. 10). 
The brain of Fuxianhuia is more readily compared to that of mandi- 
bulates than to chelicerates, which, apart from Araneae, have only one 
bona fide retinotopic neuropil and lack reversing chiasmata and optic 
lobes per se. Spider visual systems entirely separate the colour-sensitive 
and colour-insensitive pathways and have convergently evolved two 
nested retinotopic neuropils for each eye type, plus two deeper struc- 
tures that are most similar to central complex neuropils and mush- 
room bodies®. 

A position of Fuxianhuia in the arthropod stem-group requires che- 
licerate neural characters to be simplified compared with an ancestor 
that possessed the shared features of Fuxianhuia and crown pancrus- 
taceans, excluding Branchiopoda, and it implies that the ground pattern 
organization of the crustacean brain evolved very early. Alternatively, it 
could imply that the malacostracan-like characters of Fuxianhuia are 
convergent with those of Pancrustacea/Tetraconata. In either case, 
the brain and optic lobes of Fuxianhuia suggest that the arthropod 
nervous system acquired complexity by the early Cambrian. Indeed, 
it is expected that optic lobes would have already evolved sophisticated 
circuits even more deeply in the arthropod stem-group, enabling high- 
level visual processing of the kind presumed to be associated with large 
compound eyes belonging to the stem-group arthropod Anomalocaris’. 


METHODS SUMMARY 


All studied specimens are deposited in the Yunnan Key Laboratory for Palaeo- 
biology, Yunnan University, Kunming, China. Specimens were prepared with fine 
needles under high magnification using stereomicroscopes. Digital images of 
the fossils were captured using a Nikon SMZ1000 photomicroscope and were 
processed in Adobe Photoshop CS5. Figure 2f was taken using a Zeiss Stemi 
SV11 photomicroscope equipped with a Jenoptic ProgRes C3 digital camera. 
Image enhancement of lens periodicity (Fig. 3f) was obtained by superimposing 
two identical images of the eye, colour substituted for contrast, and using the 
darkening function to shift the top image upwards by a distance equal to the width 
of one lens. The image of Fig. 3a had green tones removed to accentuate contrast of 
coloured structures against the matrix. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


All studied specimens are deposited in the Yunnan Key Laboratory for Palaeo- 
biology, Yunnan University, Kunming, China. Specimens were prepared with fine 
needles under high magnification using stereomicroscopes. Digital images of the 
fossils were captured using a Nikon SMZ1000 photomicroscope and were processed 
in Adobe Photoshop CS 5. Figure 2f was taken using a Zeiss Stemi SV11 photo- 
microscope equipped with a Jenoptic ProgRes C3 digital camera. Image enhance- 
ment of lens periodicity (Fig. 3f) was obtained by superimposing two identical 
images of the eye, colour substituted for contrast, and using the darkening function 
to shift the top image upwards by a distance equal to the width of one lens. The image 


of Fig. 3a had green tones removed to accentuate contrast of coloured structures 
against the matrix. 

Scanning electron microscopy and energy-dispersive X-ray spectroscopy 
were performed with a Zeiss (LEO) 1450 VP-SEM and an Oxford Instruments 
X-Max 80mm” Silicon Drift Detector (SDD) with Oxford Instruments INCA 
microanalysis system. The INCA software PhaseMap option was used to obtain 
a tri plot of Si, Fe and Al. Binary maps were obtained from areas with a high 
iron concentration (red) and high silicon + aluminium concentration (blue). 
These maps were then combined arithmetically, as shown in Supplementary 
Fig. 4c, f, i. 
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Dopaminergic neurons inhibit striatal output 
through non-canonical release of GABA 


Nicolas X. Tritsch', Jun B. Ding'+ & Bernardo L. Sabatini! 


The substantia nigra pars compacta and ventral tegmental area 
contain the two largest populations of dopamine-releasing neurons 
in the mammalian brain. These neurons extend elaborate projec- 
tions in the striatum, a large subcortical structure implicated in 
motor planning and reward-based learning. Phasic activation of 
dopaminergic neurons in response to salient or reward-predicting 
stimuli is thought to modulate striatal output through the release of 
dopamine to promote and reinforce motor action’ *. Here we show 
that activation of dopamine neurons in striatal slices rapidly inhi- 
bits action potential firing in both direct- and indirect-pathway 
striatal projection neurons through vesicular release of the inhi- 
bitory transmitter GABA (y-aminobutyric acid). GABA is released 
directly from dopaminergic axons but in a manner that is inde- 
pendent of the vesicular GABA transporter VGAT. Instead, 
GABA release requires activity of the vesicular monoamine trans- 
porter VMAT2, which is the vesicular transporter for dopamine. 
Furthermore, VMAT2 expression in GABAergic neurons lacking 
VGAT is sufficient to sustain GABA release. Thus, these findings 
expand the repertoire of synaptic mechanisms used by dopamine 
neurons to influence basal ganglia circuits, show a new substrate 
whose transport is dependent on VMAT2 and demonstrate that 
GABA can function as a bona fide co-transmitter in monoaminer- 
gic neurons. 

The striatum integrates inputs from cortex, hippocampus, thalamus, 
amygdala and ventral tegmental area/substantia nigra pars compacta 
(VTA/SNc) to instruct the selection of appropriate motor actions. 
Inputs from midbrain dopamine (DA) neurons play an important role 
in this process, as evidenced by the psychomotor deficits that arise after 
loss of these cells in Parkinson’s disease, or by the occurrence of com- 
pulsive and addictive behaviours upon potentiation of dopaminergic 
signalling*’”. Through release of DA, these neurons promote activation 
of direct-pathway striatal projection neurons (dSPNs), which express 
Go,/4-coupled D, receptors, and inhibit indirect-pathway SPNs 
(iSPNs), which express Go,,.-coupled D, receptors**. However, mid- 
brain DA neurons also express neuropeptides® and a subset releases 
glutamate”"’’, suggesting that the net effects of activity in these cells 
may not be limited to the actions of DA. 

To investigate how DA neurons influence neuronal activity in stria- 
tum, we expressed the light-activated cation channel channelrhodopsin- 
2 (ChR2)” in SNc neurons using Cre recombinase-dependent 
adeno-associated viruses (AAVs). In Slc6a3RES~Cre’™t mice, Cre 
expression shows high penetrance and specificity for midbrain DA 
neurons'* (Supplementary Fig. 1). We validated SNc targeting using 
an AAV encoding Cre-dependent enhanced green fluorescent protein 
(EGFP). Fluorescence was restricted to Cre-containing neurons in 
SNc, and most EGFP-expressing cells (97.9%, n = 1587 cells; three 
mice) were immunopositive for the catecholamine biosynthetic 
enzyme tyrosine hydroxylase, demonstrating specific expression in 
DA neurons (Fig. 1a, b and Supplementary Figs 1 and 2). Moreover, 
EGFP* axons densely innervated dorsal striatum (Fig. 1c), consistent 
with their nigrostriatal identity. 


Carbon-fibre amperometry confirmed the ability to evoke DA 
release from ChR2-expressing axons in slices of dorsal striatum. Brief 
flashes of blue light (1 ms) reliably evoked DA transients, which were 
sensitive to the tyrosine hydroxylase antagonist «-methyl-tyrosine and 
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Figure 1 | DA neuron stimulation inhibits SPNs. a, Coronal midbrain 
section froma Slc6a3'**~ et mouse transduced with a Cre-dependent EGFP 
AAV (green). DA neurons immunolabelled for tyrosine hydroxylase (TH, red). 
DAPI nuclear stain (blue). b, Higher magnification of boxed area ina. Although 
expression levels vary greatly, most EGFP” cells are tyrosine-hydroxylase- 
positive. c, EGEP* SNc neurons densely innervate dorsal striatum (dStr). vStr; 
ventral striatum. d, Carbon-fibre recording configuration in a sagittal brain 
slice. ChR2* DA neurons depicted red, laser illumination area blue. e. Left, light 
stimulation (blue) of DA terminals in dorsal striatum evoked cadmium-, 
reserpine-, TBZ- and «-methyl-tyrosine («-MT)-sensitive DA release 
measured by amperometry. Stimulation artefacts blanked for clarity. Right, 
mean (n = 4-19) peak extracellular DA concentration after optogenetic 
stimulation of nigrostriatal axons. DA transients recovered upon TBZ washout 
(wash). *P < 0.05 versus ACSF (Mann-Whitney rank sum test). f, g, Left, 
membrane potential responses of a dorsal striatum dSPN (f) or iSPN (g) to 
current injection (1.3 s, bottom) and ChR2-mediated stimulation of DA axons 
(1 ms, blue). Right, average normalized duration of three consecutive interspike 
intervals (Norm. ISI) that precede, straddle or follow the light flash (positions 
‘1’, ‘0 and ‘+1’, respectively) upon ChR2 stimulation in ACSF (blue), DA 
receptor blockers (D,/.R antagonists = SCH23390 + SKF83566 + sulpiride + 
L-741,626; green) or SR95531 (GABAg receptor blocker; purple). Black, light 
prevented from entering the sample; n = 14 dSPNs, 11 iSPNs. *P < 0.05 (two- 
way ANOVA). Data in e-g represent mean + s.e.m. 
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Figure 2 | DA neurons directly release GABA onto SPNs. a, Evoked current 
responses from an iSPN held at indicated potentials (V;,) to optogenetic 
activation of nigrostriatal axons (1 ms, blue) upon sequential bath application 
of SR95531 and NBQX/CPP. b, As in a for a dSPN with antagonists applied in 
reverse order. c, Mean IPSC (red) and EPSC (grey) absolute amplitudes in 
dSPNs (n = 8) and iSPNs (n = 21). *P < 0.05 versus IPSC (Mann-Whitney 
rank sum test). d, Normalized IPSCs and EPSCs from a (dark) and b (light) 
shown on an expanded timescale. Blue, light presentation. e, Average (n = 29 
SPNs) IPSC (red) and EPSC (grey) latencies from light onset to current onset 
(circle), half maximal amplitude (triangle) and peak amplitude (square). IPSCs 
are not delayed compared with EPSCs. f, g, Mean (n = 3-18) EPSC (f) and 
IPSC (g) amplitudes under control conditions (ACSF) or in indicated 
antagonists normalized to baseline. *P < 0.05 versus ACSF (Mann-Whitney 
rank sum test). h, Amperometric DA transients (black) and current responses 
of a voltage-clamped iSPN (V;, = 0 mV, red; V;, = —70 mV, grey) to DA 
neuron stimulation under baseline conditions (left), in TTX (middle) and after 
co-application of TTX and 4-aminopyridine (4AP, right). i, Extracellular DA 
concentration (black), IPSC amplitude (red) and EPSC amplitude (grey) 
evoked by ChR2 stimulation across conditions normalized to baseline. Data 
from ACSF condition same as in Figs le and 2f, g. *P < 0.05 versus ACSF; 
** < 0.05 versus TTX (Mann-Whitney rank sum test). Error bars, s.e.m. 


the VMAT2 antagonists reserpine and tetrabenazine (TBZ) (Fig. 1d, e). 
Moreover, DA release required Ca”* channels, as it was blocked 
by cadmium, and activation of D, receptors with quinpirole reduced 
DA release, consistent with the effect of DA autoreceptors in nigros- 
triatal axons!». 

To determine the net effect of DA neuron stimulation on striatal out- 
put, we performed whole-cell current-clamp recordings from dSPNs and 


LETTER 


iSPNs in brain slices obtained from Slc6a3'®®S~ Ce". Drd2-EGFP 
mice. Whereas action potentials evoked by current steps occurred at 
regular intervals under baseline conditions, light presentation reliably 
paused firing and rapidly hyperpolarized SPNs of both populations 
(dSPNs: 6.8 + 1.9 mV, n = 10; iSPNs: 7.0 + 1.4mV, n = 7) (Fig. 1f, g). 
The light-evoked pause and hyperpolarization were unaffected by a 
cocktail of antagonists targeting DA receptors (dSPNs: 6.6 + 2.2 mV, 
n= 4; iSPNs: 9.5+2.5mV, n=4; both P>0.05 versus light only, 
Mann-Whitney rank sum test). Instead, they were abolished by the 
GABA, receptor antagonist SR95531 (n= 4 dSPNs, seven iSPNs) 
(Fig. 1f, g), which does not alter DA release (Fig. le). These data 
indicate that DA neurons exert a rapid and strong inhibitory influence 
on SPNs through activation of GABA, receptors. 

Previous attempts at characterizing DA receptor-independent 
effects of DA neurons on SPNs showed a small, but rapid excitatory 
influence mediated by co-release of glutamate’’*. However, these 
experiments were performed in the presence of GABA receptor 
antagonists, precluding the detection of inhibitory influences. To 
observe conductances recruited after DA neuron stimulation, we per- 
formed whole-cell voltage-clamp recordings from dSPNs and iSPNs in 
dorsal striatum without pharmacological blockers. When SPNs were 
held at —70 mV (Eq, the reversal potential for chloride), nigrostriatal 
fibre stimulation evoked fast inward currents in approximately 75% 
of dSPNs (n = 6/8) and iSPNs (n = 16/21) (Fig. 2a—e). These currents 
showed similar properties in both cell types (Fig. 2a-c) and were con- 
sequently pooled for analysis: they exhibited peak amplitudes of 
40+ 5pA (n= 22), rise times of 2.6 + 0.4ms and decay time con- 
stants of 6.8+ 1.5ms. Moreover, they reversed at approximately 
OmV and were sensitive to the o-amino-3-hydroxy-5-methyl-4- 
isoxazole propionic acid (AMPA) and NMDA (N-methyl-p-aspartate) 
receptor blockers NBQX and CPP (Fig. 2a, b, f), indicating that they 
are glutamatergic excitatory postsynaptic currents (EPSCs). Although 
glutamate release by midbrain DA neurons was proposed to be limited 
to mesolimbic fibres innervating ventral striatum'’, the currents 
reported here in dorsal striatum show similar properties'’””, and are 
sensitive to the same pharmacological agents as DA release (Figs le 
and 2f), suggesting a dopaminergic origin. Moreover, EPSCs were 
unaffected by DA receptor antagonists (Fig. 2f), indicating that 
glutamate release is not secondary to DA receptor activation. 

To isolate conductances that mediate SPN inhibition, the mem- 
brane potential was held at the reversal potential of ionotropic glu- 
tamate receptors (~OmV). Under these conditions, DA neuron 
stimulation evoked large outward currents in all recorded dSPNs 
(n = 8) and iSPNs (n = 21) (Fig. 2a-c and Supplementary Fig. 3). 
Collectively, these currents had peak amplitudes of 617+ 78pA 
(n = 29; range = 0.11-1.93 nA), rise times of 2.0 + 0.2 ms and decay 
time constants of 56 + 4 ms. They reversed at E,) and were blocked by 
SR95531 and bicuculine, but not by the GABAg receptor antagonist 
TPMPA (Fig. 2a, b, g), indicating that they represent GABA, receptor- 
mediated inhibitory postsynaptic currents (IPSCs). Similar observa- 
tions were made in mice expressing Cre under control of a tyrosine 
hydroxylase promoter (Supplementary Fig. 4). Consistent with previous 
reports'’’”, no current remained with both glutamate and GABA, 
receptors blocked. Thus, these data show that (1) activation of dopami- 
nergic terminals rapidly activates ionotropic glutamate and GABA 
receptors in SPNs, (2) the resulting currents do not differentially affect 
dSPNs and iSPNs, and (3) GABAergic conductances mediate the net 
inhibitory effect of DA neurons on striatal output. 

DA neurons may activate GABA, receptors on SPNs either by 
recruiting a population of GABAergic interneurons—a mechanism 
akin to feed-forward inhibition—or by directly releasing GABA. 
Several lines of evidence suggest the latter. First, the latency between 
light and IPSC onset averaged 2.2 + 0.1 ms (n = 29; Fig. 2d, e), which 
may be too short to accommodate two synaptic transmission steps at 
32-34 °C (ref. 16). Second, GABAergic conductances preceded or 
occurred synchronously with EPSCs (Fig. 2d, e), which arise after 
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glutamate co-release from DA terminals’”” (Fig. 2h, i). Third, a popu- 
lation of dopaminergic SNc neurons expresses messenger RNA for 
glutamic acid decarboxylase (GAD-65)”, indicating that they may 
synthesize GABA. We reasoned that if GABA originates from dopa- 
minergic terminals, optically evoked IPSCs, EPSCs and extracellular 
DA should be similarly affected by pharmacological agents and persist 
under conditions that prevent disynaptic transmission. Indeed, IPSCs 
in SPNs were reduced by quinpirole, eliminated by cadmium, but were 
insensitive to glutamate and DA receptor inhibitors (Fig. 2g), indi- 
cating that they require Ca” * -dependent release of a transmitter other 
than glutamate or DA. Moreover, light-evoked IPSCs, EPSCs and 
DA release were abolished in the presence of the voltage-gated Na* 
channel blocker tetrodotoxin (TTX), showing that ChR2-mediated 
depolarization is not sufficient to trigger transmitter release from 
nigrostriatal axons (Fig. 2h, i). Neurotransmission can be rescued from 
directly illuminated ChR2-expressing terminals in the presence of TTX 
by providing extra depolarization with the voltage-gated K* channel 
blocker 4-aminopyridine’’. Accordingly, IPSCs, EPSCs and DA release 
recovered upon co-application of TTX and 4-aminopyridine (Fig. 2h, i), 
indicating that GABA, glutamate and DA are directly released from 
dopaminergic terminals. 

The vesicular GABA transporter VGAT (encoded by S/c32a1) is the 
only transporter known to package GABA into synaptic vesicles and 
is considered indispensable for inhibitory synaptic transmission”. 
We generated conditional knockout (cKO) mice in which VGAT is 
specifically deleted from DA neurons (Slc6aZtRES— Celt, 51-3. 9g [!0x/l0x 
mice), predicting that ChR2-evoked IPSCs would be abolished. 
However, light-evoked IPSCs and EPSCs were unaffected in these mice 
(Fig. 3a, b, g, j), indicating that VGAT is not responsible for vesicular 
loading of GABA in DA neurons. We instead hypothesized that GAD- 
65 may synthesize GABA from its metabolic precursor glutamate once 
inside synaptic vesicles. If correct, preventing glutamate loading 
into synaptic vesicles by genetically ablating the vesicular glutamate 
transporter 2 (VGLUT2; encoded by Slc17a6) from DA neurons 
(Slc6a3'RES~ Cet. $1176!" mice) should eliminate IPSCs and 
EPSCs"®. However, conditional deletion of VGLUT2 abolished light- 
evoked EPSCs in SPNs without affecting IPSC amplitude or latency 
(Fig. 3c, g, j), excluding this possibility. 

These results indicate that GABA release originates in nigrostriatal 
terminals but is independent of VGAT and VGLUT2, suggesting the 
existence of an alternative vesicular transporter with previously 
unidentified activity for GABA. Consistent with this, IPSCs were elimi- 
nated in slices obtained from mice treated with the VMAT2 antagonists 
reserpine, Ro4-1284 or TBZ and largely recovered upon Ro4-1284 and 
TBZ washout (Fig. 3d—g). The same manipulation did not affect EPSCs 
or VGAT-dependent GABA release from SPNs (Supplementary Fig. 5), 
and we confirmed that DA itself does not function as a GABA, receptor 
agonist in SPNs (Supplementary Fig. 6). Moreover, DA depletion with 
a-methyl-tyrosine (Fig. le) had no effect on IPSCs or EPSCs (Fig. 3g). 
We therefore conclude that VMAT2, but not DA, is required for the 
release of GABA by DA neurons. 

If GABAergic IPSCs depend on VMAT2 solely for GABA transport 
into synaptic vesicles, IPSCs should be restored in reserpine-treated 
mice by expressing VGAT in DA neurons (Fig. 3h). Accordingly, in 
Sle6a3'**S“"' mice injected with AAV encoding Cre-dependent 
VGAT (AAV-DIO-VGAT) and treated with reserpine, optogenetic 
stimulation of nigrostriatal axons elicited large SR95531-sensitive 
IPSCs exhibiting synaptic latencies (Fig. 3g—j) and rise times (1.8 + 0.3, 
n= 10; P>0.05 versus control, Mann-Whitney rank sum test) indis- 
tinguishable from IPSCs observed in untreated mice. Together, these 
data indicate that presynaptic DA terminals contain GABA, the synaptic 
packaging of which requires VMAT2 but not VGAT. 

VMAT2 transports a variety of substrates”, including catechola- 
mines, serotonin and histamine. Although GABA does not bear struc- 
tural resemblance to known VMAT2 substrates, our findings suggest 
that VMAT2 may function as a vesicular GABA transporter. To test 
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Figure 3 | GABA release from DA neurons requires VMAT2 but not VGAT. 
a-c, Representative ChR2-evoked (1 ms, blue) IPSCs (red, V;, = 0 mV) and 
EPSCs (grey, V,; = —70 mV) from SPNs in Sle6a3R®S— Celt (control, 

a), Slce6aZES— Cre, S1-3.2q [ox (VGAT cKO, b) and 

Slc6aZRES— Celt SI 1 7g6'°*'* (VGLUT2 cKO, c) mice. d-e, As ina for control 
Sle6a3**S~ “mice treated in vivo and in vitro with the VMAT2 antagonists 
reserpine (d), Ro4-1284 (e) or TBZ (f). g, Mean IPSC (red) and EPSC (grey) 
amplitudes across conditions (n = 4-33). Washout, slices obtained from mice 
treated in vivo with TBZ or Ro4-1284 and subsequently allowed to recover in 
vitro for more than 1h in ACSF. *P < 0.05 versus ACSF; **P < 0.05 versus 
TBZ/Ro4-1284; ***P < 0.05 versus reserpine (Kruskal-Wallis analysis of 
variance). h, Schematic of working hypothesis: Provided DA neurons contain 
cytosolic GABA, viral expression of reserpine-resistant VGAT in DA neurons 
should rescue GABA transport into synaptic vesicles (SVs) and IPSCs in 
reserpine-treated mice. Note that VGAT might also incorporate into VMAT2> 
vesicles. i, Voltage-clamp recording (V;, = 0 mV, red; V;, = —70 mV, grey) 
from a reserpine-treated iSPN upon optogenetic stimulation (1 ms, blue) of 
VGAT-expressing DA axons. j, IPSC onset latencies (grey, individual cells; red, 
average) did not differ across conditions. res., reserpine. GABAergic nature of 
outward currents (Vj, = 0 mV) in b, ¢ and i was confirmed with SR95531 
(pink). Error bars, s.e.m. 


this possibility, we asked if VMAT2 can substitute for VGAT to sustain 
GABA release in a population of non-monoaminergic GABAergic 
neurons. Specifically, we attempted to restore GABA release in 
iSPNs devoid of VGAT by exogenously expressing VMAT2. We con- 
ditionally excised the gene encoding VGAT in iSPNs and virally 
expressed ChR2 in these cells to allow monitoring of GABA release 
from iSPN axon collaterals onto neighbouring dSPNs as light-evoked 
IPSCs" (Fig. 4a). Whereas optogenetic stimulation of iSPNs in control 
mice (Adora2a-Cre;Slc32a1'°“™';Drd2-EGFP) reliably evoked large 
IPSCs in dSPNs, IPSCs were almost entirely abolished in dSPNs from 
VGAT cKO mice (Adora2a-Cre;Sle32a 1'*"*;Drd2-EGEP) (Fig. 4b, c,e), 
confirming the dependence of vesicular GABA transport in SPNs 
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Figure 4 | VMAT2 functions as a vesicular GABA transporter. 

a, Experimental setup: ChR2 was selectively expressed in Cre-containing iSPNs 
of mice with one (control; Adora2a-Cre;Slc32a1'°"™;Drd2-EGEP mice) or both 
alleles of the gene encoding VGAT flanked by lox sites (VGAT cKO; Adora2a- 
Cre;Slc32a1'"'*;Drd2-EGFP mice). VGAT cKO + VMAT2, an AAV 
encoding Cre-dependent VMAT2 (AAV-DIO-VMAT2), was co-injected with 
AAV-DIO-ChR2 in the striatum of VGAT cKO mice to rescue GABA release 
from iSPNs. b-d, Voltage-clamp recordings (V;, = 0 mV) of axon-collateral 
IPSCs in dSPNs evoked by optogenetic stimulation (1 ms, blue) of iSPNs in the 
absence (red) or presence (pink) of SR95531 in control (b), VGAT cKO (c) and 
VGAT cKO + VMAT2 (d) mice. Insets: iSPN presynaptic terminal schematic 
illustrating experimental conditions. Red triangles, GABA. e, Summary 
histogram (mean + s.e.m.) of experiments in b-d (n = 10-15 dSPNs). 

*P < 0.05 versus control and VGAT cKO+ VMAT2 (Kruskal-Wallis analysis 
of variance); **P < 0.05 versus IPSC without $R95531 (Mann-Whitney rank 
sum test). f, IPSC onset latencies from light presentation onset (grey, individual 
cells; red, mean + s.e.m.) in control (Adora2a-Cre;Slc32a1'™'; Drd2-EGFP 
mice) and VGAT cKO+VMAT2 mice (Adora2a-Cre;Slc32a1'™"Drd2- 
EGFP mice transduced with AAV-DIO-VMAT2 in striatum). 


on VGAT"*”. By contrast, in VGAT cKO mice transduced with Cre- 
dependent VMAT2 (AAV-DIO-VMAT2), light reliably evoked large, 
short-latency IPSCs (Fig. 4d-f). Thus, VMAT2 expression in GABAergic 
neurons can functionally replace VGAT and sustain inhibitory synap- 
tic transmission. 

Midbrain DA neurons are critical for the initiation, selection and 
reinforcement of motor actions and have been implicated in motor, 
cognitive and addictive disorders’ ’. Their effects are largely attributed 
to the slow neuromodulatory actions of DA receptors'*. Our studies 
show that DA neurons also exert a rapid and potent inhibitory influ- 
ence on the activity of SPNs by releasing another transmitter that 
activates GABA, receptors. Release of this transmitter depends on 
VMAT2 activity, indicating that VMAT2 or a molecular complex 
requiring VMAT2 activity packages it in synaptic vesicles together 
with DA. The simplest model accounting for our data is that GABA 
is the transmitter packaged by VMAT2 and released onto SPNs. 
However, we cannot rule out that another molecule present in iSPNs 
and DA neurons, acting as a GABA, receptor agonist, and serving as a 
substrate for both VMAT2 and VGAT, fulfils that role. 

It is estimated that 5-10% of SNc DA neurons express GAD-65 and 
fewer than 1% express VGLUT2 in rodents’®’””’. Therefore, distinct 
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subpopulations of DA neurons may release GABA and glutamate, and 
reliable detection of IPSCs and EPSCs may result from innervation of 
SPNs by several DA neurons”. Alternatively, GABAergic transmission 
may be common to midbrain DA neurons as any means of acquiring 
GABA, such as plasma membrane uptake, would result in vesicular 
transport. GAD expression varies in response to neuronal activity~* 
and damage”, suggesting that GABAergic signalling from DA neurons 
may be altered by drug abuse. Our findings also suggest that loss of 
GABA release in the striatum may contribute to the symptoms of 
Parkinson’s disease as well as to the alterations of striatal circuitry that 
accompany loss of midbrain DA neurons. 

Because all monoaminergic neurons express VMAT2, GABA co- 
release may extend to adrenergic, noradrenergic, serotonergic, histami- 
nergic as well as other dopaminergic neurons. Indeed, GABA release was 
reported from dopaminergic amacrine cells in the retina” and periglo- 
merular cells in the olfactory bulb”, although dependence on VMAT2 
remains to be determined. Moreover, a considerable fraction (up to 80%) 
of monoaminergic neurons in locus coeruleus”, hypothalamus” and 
raphe nuclei” contain GABA or express GAD. Together, these findings 
expand the repertoire of synaptic mechanisms available to monoami- 
nergic cells and suggest that perturbations of GABA co-transmission 
might contribute to the aetiology of monoaminergic pathologies or to 
the therapeutic efficacy of VMAT2 antagonists. 


METHODS SUMMARY 


Stereotaxic injections of Cre-dependent AAVs were targeted to SNc of postnatal 
day (P)18-25 Slc6a3'R*S~ Ce. Dyd2-EGFP mice, in which Cre and EGFP are 
respectively restricted to cells expressing the plaama membrane DA transporter 
DAT and D;j receptors. Conditional deletion of VGAT or VGLUT2 in these mice 
was respectively achieved by breeding in Slc32a1'™ or Slc17a6'™ alleles to homo- 
zygosity. AAVs were allowed to express for at least 21 days before experimentation. 

Whole-cell current- and voltage-clamp recordings were obtained from pathway- 
identified dorsal striatum SPNs in acute parasagittal slices of mature (P40-218) 
mice at 32-34°C using standard techniques. Constant-potential amperometry 
(+400 to 600 mV versus Ag/AgCl) was performed using carbon-fibre microelec- 
trodes. ChR2-expressing fibres were stimulated using brief full-field flashes of blue 
laser light (1 ms; 473 nm; 6.5-10.0 mW mm ”) at intervals of 30s or more. Traces 
are the average of three to five consecutive acquisitions. For pharmacological ana- 
lyses, the peak amplitude of three consecutive light-evoked responses 3-4 min after 
drug perfusion onset were averaged, normalized to baseline and compared with 
values obtained at corresponding times in control preparations bathed in artificial 
cerebrospinal fluid (ACSF). Data in text and figures are reported as mean + stand- 
ard error of the mean (s.e.m.). Statistical tests are noted in the text (significance: 
P<0.05). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Mice. Knock-in mice bearing an internal ribosome entry site (IRES)-linked Cre 
recombinase gene downstream of the gene Slc6a3, which encodes the plasma 
membrane DA transporter DAT (referred to as Slc6a3'®*S~C mice)"* were 
obtained from the Jackson Laboratory (stock number 006660). Homozygous 
(Slc6aZtRES~ CreIRES~ Cre) and heterozygous (Slce6a3'RES—e/) animals were 
bred with Drd2-EGFP transgenic mice (GENSAT, founder line S118), which 
express EGFP under control of a bacterial artificial chromosome containing 
the D, receptor genomic locus to permit distinction between direct- and in- 
direct-pathway SPNs*". Alternatively, Slc6a3'™*S~ "* mice were crossed with mice 
bearing a Cre-dependent TdTomato reporter transgene (Ail4; the Jackson 
Laboratory, stock number 007914; referred to as Rosa26'! tomato mice) to show 
the distribution of Cre* cells. Mice in which the second exon of Slc32a1, which 
encodes the vesicular GABA transporter (VGAT), or Slc17a6, which encodes 
the vesicular glutamate transporter 2 (VGLUT2), is flanked by lox sites 
(Sle32a1'** and Slc17a6'*, respectively) were provided by B. Lowell??*4 
Conditional deletion of VGAT or VGLUT2 from Cre-expressing DA neurons 
was achieved by crossing Slc6a3'*S~ Ce $ic32a1'™" and Slc32a1'™";Drd2- 
EGEP mice, or Sle6a3'®*S— S11 76!" and Slc17a6'*™ mice, respectively. 
No differences were observed between control Sle6a3'*"S~°* mice and 
Sle6a3'®*S~“* mice carrying a single floxed allele of SIc32a1 or Sic17a6, so data 
from these animals were pooled. iSPNs were genetically targeted using Adora2a- 
Cre bacterial artificial chromosome transgenic mice (GENSAT, founder line 
KG139), which express Cre under transcriptional control of the adenosine Az, 
receptor genomic promoter’. Conditional deletion of VGAT in Cre-expressing 
iSPNs was achieved by crossing Adora2a-Cre;Slc32a1'°™' and Slc32a 1!" 
Drd2-EGFP mice. Other lines used included Th-Cre transgenic mice (the 
Jackson Laboratory, stock number 008601) and Drdla-Cre bacterial artificial 
chromosome transgenic mice (GENSAT, founder line EY262). With the excep- 
tion of Slc6a3'*S—C* and Rosa26! °™"° mice, which were maintained on a 
C57BL/6 background, all other strains were maintained on a mixed background of 
C57BL/6 and FVB. All experimental manipulations were performed in accordance 
with protocols approved by the Harvard Standing Committee on Animal Care 
following guidelines described in the US National Institutes of Health Guide for the 
Care and Use of Laboratory Animals. 

Virus preparation. Conditional expression of the light-gated non-selective cation 
channel channelrhodopsin-2 (ChR2, H134R variant) in Cre-containing neurons 
was achieved using a recombinant AAV encoding a double-floxed inverted open 
reading frame (DIO) of the ChR2-mCherry fusion protein under transcriptional 
control of the EFla promoter (AAV-DIO-ChR2; http://www.stanford.edu/ 
group/dlab/optogenetics/sequence_info.html#dio). Cre-dependent AAV vectors 
encoding EGFP (AAV-DIO-EGFP), VGAT (NM_009508.2; AAV-DIO-VGAT; 
Addgene plasmid 39320) or VMAT2 (NM_172523.3; AAV-DIO-VMAT2; 
Addgene plasmid 39339) were generated by replacing the ChR2-mCherry coding 
sequence using Ascl and Nhel restriction sites (Genscript). These viral vectors 
were subsequently packaged (serotype 8) by a commercial vector core facility 
(University of North Carolina). All AAVs were stored in undiluted aliquots at a 
working concentration of at least 10'7 genomic copies per millilitre at — 80 °C until 
intracranial injection. 

Stereotaxic intracranial injections. Male and female mice (P18-25) were anaes- 
thetized with isoflurane and placed in a small animal stereotaxic frame (David 
Kopf Instruments). After exposing the skull under aseptic conditions, a small burr 
hole was drilled and AAVs were injected (0.5-1 ul total volume) unilaterally 
through a pulled glass pipette at a rate of 100 nl min! using a UMP3 microsyringe 
pump (World Precision Instruments). Injection coordinates were 0.8 mm anterior 
from Lambda, 1.3 mm lateral and 4.4 mm below pia for SNc, and 0.5 mm anterior 
from Bregma, 1.75 mm lateral, and 2.7 mm below pia for dorsal striatum. After 
surgical procedures, mice were returned to their home cage at least 21 days to allow 
for maximal gene expression. To identify striatum-projecting neurons in ventral 
midbrain, Slc6a3' RS Ce. Rosa2os tdtomato/wt mice were injected with 0.1-0.2 ul 
fluorescent latex microspheres (Green Retrobeads, Lumafluor) in dorsal striatum 
(same coordinates as above) and allowed to recover in their home cage for 7 days 
before processing their brain for tyrosine hydroxylase immunolabelling. 
Immunocytochemistry. Mice were deeply anaesthetized with isoflurane and 
perfused transcardially with 4% paraformaldehyde in 0.1 M sodium phosphate 
buffer. Brains were post-fixed for 1-3 days, washed in phosphate buffered saline 
(PBS) and sectioned (40-50 um) coronally (Vibratome). Free-floating sections 
were permeabilized/blocked with 5% normal goat serum in PBS with 0.3% 
Triton X-100 (PBST) for 2h at room temperature and incubated with primary 
antibodies at 4°C overnight and with secondary antibodies for 2h at room tem- 
perature in PBST supplemented with 1% normal goat serum. Brain sections were 
mounted on superfrost slides, dried and coverslipped with ProLong antifade 
reagent with DAPI (Molecular Probes). Primary antibodies used included rabbit 
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anti-tyrosine-hydroxylase (1:2,000; AB152, Millipore), mouse anti-tyrosine- 
hydroxylase (1:1,000; 22941, ImmunoStar) and rabbit anti-VMAT2 (1:2,000; 
ab81855, Abcam). Alexa Fluor 488- and 647-conjugated secondary antibodies 
to rabbit and mouse (Invitrogen) were diluted 1:1,000. Endogenous TdTomato 
and EGFP fluorescences were not immuno-enhanced. Whole sections were 
imaged with an Olympus VS110 slide scanning microscope. High-resolution 
images of regions of interest were subsequently acquired with a Zeiss LSM 510 
META confocal microscope (Harvard NeuroDiscovery Center). Images represent 
maximum intensity projections of 3-7 um confocal stacks. 

Slice preparation. Acute brain slices were obtained from 40- to 218-day-old 
mice (median = 74 days) using standard techniques. Mice were anaesthetized 
by isoflurane inhalation and perfused cardiacly with ice-cold ACSF containing 
(in mM) 125 NaCl, 2.5 KCI, 25 NaHCOs, 2 CaCl, 1 MgCl, 1.25 NaH,PO, and 11 
glucose (295 mOsm kg '). Cerebral hemispheres were removed, placed in cold 
choline-based cutting solution (consisting of (in mM): 110 choline chloride, 25 
NaHC0Os;, 2.5 KCl, 7 MgCls, 0.5 CaCl, 1.25 NaH2POu, 25 glucose, 11.6 ascorbic 
acid, and 3.1 pyruvic acid), blocked and transferred into a slicing chamber con- 
taining ice-cold choline-based cutting solution. Parasagittal slices of striatum 
(275m thick) were cut with a Leica VT1000s vibratome, transferred for 
10-20 min to a holding chamber containing ACSF at 34°C and subsequently 
maintained at room temperature (20-22°C) until use. All recordings were 
obtained within 4h of slicing. Both cutting solution and ACSF were constantly 
bubbled with 95% O,/5% COp. 

Electrophysiology. Individual slices were transferred to a recording chamber 
mounted on an upright microscope (Olympus BX51 WI) and continuously super- 
fused (2-3 ml min ') with ACSF warmed to 32-34 °C by passing it through a 
feedback-controlled in-line heater (SH-27B; Warner Instruments). Cells 
were visualized through a X40 water-immersion objective with either infrared 
differential interference contrast optics or epifluorescence to identify EGFP* 
iSPNs and striatal regions showing the highest density of ChR2-mCherry* axonal 
arbors. Epifluorescence was used sparingly to minimize ChR2 activation before 
recording. Whole-cell voltage- and current-clamp recordings were made from 
dSPNs or iSPNs in anterior dorsolateral and dorsomedial striatum within 
300 um of the callosal-striatal border. dSPNs and iSPNs were identified on the 
basis of the respective absence or presence of EGFP fluorescence and their firing 
properties. Patch pipettes (2-4 MQ) pulled from borosilicate glass (G150F-3, 
Warner Instruments) were filled either with a Cs‘ -based low CI internal solution 
containing (in mM) 135 CsMeSO3, 10 HEPES, 1 EGTA, 3.3 QX-314 (Cl salt), 4 
Mg-ATP, 0.3 Na-GTP, 8 Naz-phosphocreatine (pH 7.3 adjusted with CsOH; 295 
mOsm+kg~') for voltage-clamp recordings, or with a K*-based low CI internal 
solution composed of (in mM) 135 KMeSOs3, 3 KCl, 10 HEPES, 1 EGTA, 0.1 
CaCl,, 4 Mg-ATP, 0.3 Na-GTP, 8 Naj-phosphocreatine (pH 7.3 adjusted with 
KOH; 295 mOsm:kg *) for current-clamp recordings. Bath solutions for whole- 
cell recordings did not contain drugs unless specified otherwise. For all voltage- 
clamp experiments, errors due to the voltage drop across the series resistance 
(<20 MQ) were left uncompensated. Membrane potentials were corrected for a 
~8 mV liquid junction potential. To activate ChR2-expressing fibres, light from a 
473 nm laser (Optoengine) was focused on the back aperture of the microscope 
objective to produce wide-field illumination of the recorded cell. Brief pulses of 
light (1 ms duration; 6.5-10.0 mW mm ” under the objective) were delivered at 
the recording site at 30s intervals under control of the acquisition software. For 
current-clamp recordings, depolarizing current steps evoking 10-20 Hz trains of 
action potentials were applied at regular intervals (10-15) either alone or in 
combination with a 1 ms flash of blue light. 

Amperometric recordings. Constant-potential amperometry was performed 
using homemade glass-encased carbon-fibre microelectrodes (71m diameter, 
50-100 jum length) placed approximately 50 ym within dorsal striatum slices and 
held at a constant voltage of +400 to 600 mV versus Ag/AgCl by a Multiclamp 700B 
amplifier (Molecular Devices). Electrodes were calibrated with fresh 5 1M dopa- 
mine standards in ACSF using fast-scanning cyclic voltammetry (from —0.5 V to 
0.9 V, and back to —0.5 V ata rate of 230 mV ms! every 100 ms, with the electrode 
held at 0 V between scans) to determine the optimal oxidation potential, followed 
by constant-potential amperometry of dopamine flow-in to allow conversion of 
current amplitude to extracellular dopamine concentration. Dopaminergic ter- 
minals surrounding the electrode were stimulated with 1 ms flashes of blue laser 
light (6.5-10.0 mW mm ”) delivered at 2-3 min intervals. 

Reagents. Drugs (all from Tocris, unless specified otherwise) were applied by bath 
perfusion: SR95531 (10 11M), (-)biccuculine (20 1M), (1,2,5,6-tetrahydropyridin-4- 
yl)methylphosphinic acid (TPMPA; 20 11M), 2,3-dihydroxy-6-nitro-7-sulfamoyl- 
benzo(f)quinoxaline (NBQX; 101M), R,S-3-(2-carboxypiperazin-4-yl)propyl-1- 
phosphonic acid (CPP; 101M), tetrodotoxin (TTX; 111M), 4-aminopyridine 
(4AP; 0.1-1 mM), CdCl, (Sigma, 30 11M) and (—)quinpirole (10 1M). The cock- 
tail of antagonists used broadly to target D, and D, receptor families (Dj2R 
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antagonists) consisted of SCH23390 (11M), SKF83566 (1}1M), (—)sulpiride 
(10 1M) and L-741,626 (11M). To inhibit monoaminergic vesicular transport 
and deplete transmitter-filled vesicles, Slc6a3"*"S “* mice were injected intraper- 
itoneally with either the irreversible VMAT inhibitor reserpine (5mgkg ') 24h 
before slicing, the reversible VMAT antagonist Ro4-1284 (Sigma, 15 mgkg ') 1h 
before slicing or the competitive and selective VMAT2 antagonist tetrabenazine 
(TBZ;5mgkg  ') 2 hbefore slicing. To deplete presynaptic terminals of dopamine, 
Sle6a3'**S"C* mice were administered the tyrosine hydroxylase antagonist 
o-methyl-pL-tyrosine methyl ester hydrochloride (Sigma, 250mgkg | intra- 
peritoneally) 3h and 1h before slicing. Brain sections from these animals were 
prepared as described above, but were recovered and incubated in ACSF con- 
taining 1 1M reserpine, 10 1M Ro4-1284, 50 uM TBZ or 30 LM o-methyl-tyrosine, 
respectively. Half of the slices obtained from Ro4-1284- and TBZ-treated mice 
were kept at least 1h in regular ACSF before recording to allow for drug washout 
and resumption of neurotransmitter transport into synaptic vesicles (washout 
condition in Figs le and 3g). 

Data acquisition and analysis. Membrane currents and potentials were amplified 
and low-pass filtered at 3kHz using a Multiclamp 700B amplifier (Molecular 
Devices), digitized at 10 kHz and acquired using National Instruments acquisition 
boards and a custom version of ScanImage written in MATLAB (Mathworks)”*. 
Amperometry, electrophysiology and imaging data were analysed offline using Igor 
Pro (Wavemetrics) and ImageJ (National Institutes of Health). In figures, ampero- 
metry and voltage-clamp traces represent the averaged waveform of three to five 
consecutive acquisitions. Detection threshold for IPSCs and EPSCs was set at 10 pA. 
Averaged waveforms were used to obtain current latency, peak amplitude, 10-90% 


rise time and decay time. Current onset was measured using a threshold set at 
three standard deviations of baseline noise. Peak amplitudes were calculated by 
averaging over a 2 ms window around the peak. For pharmacological analyses in 
Figs le and 2f, g, i, the peak amplitudes of three consecutive light-evoked responses 
3-4 min after drug perfusion onset were averaged, normalized to baseline averages 
and compared statistically with values obtained at corresponding times in control 
preparations bathed in ACSF. Data (reported in text and figures as mean + s.e.m.) 
were compared statistically using the following: Mann-Whitney rank sum test, 
Kruskal-Wallis analysis of variance (ANOVA) with Dunn’s multiple comparison 
test, and two-way ANOVA followed by Bonferroni post-hoc tests, as indicated in 
the text. P values less than 0.05 were considered statistically significant. 
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FTO genotype is associated with phenotypic 
variability of body mass index 


A list of authors and their affiliations appears at the end of the paper. 


There is evidence across several species for genetic control of 
phenotypic variation of complex traits’*, such that the variance 
among phenotypes is genotype dependent. Understanding genetic 
control of variability is important in evolutionary biology, agricul- 
tural selection programmes and human medicine, yet for complex 
traits, no individual genetic variants associated with variance, as 
opposed to the mean, have been identified. Here we perform a 
meta-analysis of genome-wide association studies of phenotypic 
variation using ~170,000 samples on height and body mass index 
(BMI) in human populations. We report evidence that the single 
nucleotide polymorphism (SNP) rs7202116 at the FTO gene locus, 
which is known to be associated with obesity (as measured by mean 
BMI for each rs7202116 genotype)’, is also associated with 
phenotypic variability. We show that the results are not due to 
scale effects or other artefacts, and find no other experiment-wise 
significant evidence for effects on variability, either at loci other 
than FTO for BMI or at any locus for height. The difference in 
variance for BMI among individuals with opposite homozygous 
genotypes at the FTO locus is approximately 7%, corresponding to 
a difference of ~0.5 kilograms in the standard deviation of weight. 
Our results indicate that genetic variants can be discovered that are 
associated with variability, and that between-person variability in 
obesity can partly be explained by the genotype at the FTO locus. 
The results are consistent with reported FTO by environment 
interactions for BMI*, possibly mediated by DNA methylation”. 
Our BMI results for other SNPs and our height results for all SNPs 
suggest that most genetic variants, including those that influence 
mean height or mean BMI, are not associated with phenotypic 
variance, or that their effects on variability are too small to detect 
even with samples sizes greater than 100,000. 

Genetic studies of complex traits usually focus on quantifying and 
dissecting phenotypic variation within populations, by contrasting 
mean differences in phenotypes between genotypes. For example, in 
association studies the difference between the average phenotype (P) of 
each genotype is tested. In addition, the phenotypic variance among 
individuals of the same genotype (G) can vary across genotypes, so that 
phenotypic variance conditional on genotype, var(P|G), is not con- 
stant. Phenotypic variance given a particular genotype does not need to 
be due to sensitivity to external environmental factors but can, for 
example, be caused by developmental fluctuation of the internal 
micro-environment in a genotype-dependent manner". For example, 
genetic control of stochastic variation in development or in homeo- 
static control’*. The difference between genotypes can also depend on 
external factors, for example, on the environment in which they are 
reared, in which case there is a genotype by environment (G x E) 
interaction. In species in which the same genotype can be measured 
across defined environments, such as in plant or animal populations, 
the difference in mean phenotype for each genotype can be quantified 
experimentally, and is known as the reaction norm of the genotype!”. 
However, any environment is likely to be heterogeneous, so that the 
environment experienced by each individual differs, although these 
differences are not formally recognized by the experimenter. In this 
situation, if a G X E interaction exists it may manifest as differences in 


environmental sensitivity so that genotypes differ in phenotypic vari- 
ance. Therefore, even if the environments, internal or external, are not 
directly measured, evidence for genetic control of variation can be 
quantified through an analysis of variability. 

There is empirical evidence for genetic control of phenotypic vari- 
ation in several species’, including Drosophila’, snails*, maize'* and 
chickens’, and specific quantitative trait loci with an effect on variance 
have been reported for yeast* and Arabidopsis*. Many theories and 
methods to identify genetic loci responsible for phenotypic variability 
have been proposed’'®'*. In humans, there have been reports that 
variability of serum cholesterol and triglyceride levels within mono- 
zygotic twin pairs depends on their genotype at the MN blood group 
system”. In clinical practice, knowledge of phenotypic variability as a 
function of genotype may be important when the phenotypes are risk 
factors for disease or treatment response, in particular when there are 
no mean differences between genotypes in the population”. 

Detection of genetic variation in environmental or phenotypic vari- 
ance requires large sample sizes because relative to their expected 
values, the variance has a larger sampling error than the mean’®”*. 
We performed a meta-analysis of genome-wide association studies 
(GWAS) of phenotypic variation for height and BMI in human popu- 
lations on approximately 170,000 samples comprising 133,154 in a 
discovery set and 36,727 for in silico replication, and report a single 
locus with a genome-wide significant effect on variability in BMI. 
Height and BMI were chosen because genetic effects on variability in 
height and size traits have been reported in other species, and because 
very large samples of genotyped and phenotyped individuals are avail- 
able through existing research consortia. 

We performed a discovery meta-analysis of 38 studies consisting of 
133,154 individuals (60% females) of recent European decent to 
identify SNPs that are associated with the variability of height or 
BML. In each study, ~2.44 million genotyped and imputed autosomal 
SNPs were included in the analysis after applying quality-control 
filters. We adjusted height and BMI phenotypes for possible covariates 
such as age, sex and case-control status, and standardized them to z 
scores by an inverse-normal transformation. We then regressed the 
squared z scores (2 ), which are a measure of variance’’, on the geno- 
type indicator variable of each SNP to test for association of the SNP 
with trait variability. The association statistics were corrected by the 
genomic control method” in individual studies and then combined by 
an inverse-variance meta-analysis across all of the studies (see 
Methods). We selected 42 SNPs at 6 loci for height and 51 SNPs at 7 
loci for BMI with P <5 X 10~° for in silico replication (Supplementary 
Fig. 1). We examined the top two SNPs at each of the 6 loci for height 
and 7 loci for BMI in a further sample of 36,727 individuals (54% 
females) of European ancestry from 13 studies (Methods). For BMI, 
only rs7202116 at the FTO locus (Fig. 1) and rs7151545 at the RCORI 
locus (Supplementary Fig. 2) were replicated at genome-wide signifi- 
cance level, with P = 2.9 X 10 * and P = 3.6 X 10 ° in the validation 
set and P=2.4X10 7° and P=4.1 X10 ® in the combined set, 
respectively (Table 1). None of the height SNPs was replicated 
(Table 1). We show by an approximate conditional analysis using 
summary statistics from the discovery meta-analysis and estimated 
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Figure 1 | Test statistics (-logi9(P values)) for association with BMI 
variability in the discovery meta-analysis of SNPs at the FTO locus against 
their physical location. The SNPs surrounding rs7202116 are colour-coded to 
reflect their linkage disequilibrium with rs7202116. The recombination rates 
are plotted in cyan to reflect local linkage disequilibrium structure. Genes, the 


linkage disequilibrium structure from the Atherosclerosis Risk In 
Communities (ARIC) cohort that there is no secondary associated 
SNP in the FTO region when conditioning on rs7202116 (Supplemen- 
tary Fig. 3). The estimate of the effect associated with rs7202116 on 
BMI z’ was slightly larger in men (0.041, standard error (SE) = 0.009) 
than in women (0.033, SE = 0.007) in the combined set but the differ- 
ence was not significant (P = 0.670). The RCORI SNP only just passed 
the genome-wide significance level (5 X 10 *), however, it did not 
reach the experiment-wise significance level (2.5 x 10 8) considering 
that two independent traits were tested. There were several case- 
control studies included in the meta-analysis that were ascertained 
for diseases that may be correlated with BMI. We performed a further 
meta-analysis in the combined set excluding these case-control studies, 
and the FTO SNP rs7202116 remained genome-wide significant with 
P=2.8X10 "* but the RCORI SNP did not with P= 3.6 x 10 ° 
(Supplementary Table 1). We therefore focus on the FTO locus in 
the main text and provide the results for the RCORI locus in the 
Supplementary Information. 

On the scale on which BMI is measured, the predicted per-allele 
effect of the G allele (the other allele is A) of rs7202116 on the mean 


position of exons and the direction of transcription from the University of 
California, Santa Cruz (UCSC) genome browser are noted. The P value for 
rs7202116 in the combined set is represented by a purple diamond, and that 
from the discovery set by a purple circle. 


difference is 0.37 kgm’ 7 in men and 0.43 kgm’ 7 in women”, and the 
effect on the variance difference is 0.79 kg” m * in men and 1.09 kg” 
m * in women, reflecting the larger standard deviation of BMI in 
women compared with men (Supplementary Table 2). Assuming an 
additive model, the mean difference between the GG and AA geno- 
types is 0.74kgm 7 in men and 0.86 kgm 7 in women, with a vari- 
ance difference between the two genotypes of 1.58 kg” m™ * in men and 
2.18kg”m * in women, which is 7.2% of the phenotypic variance of 
BMI in both men and women. To provide an illustration of the effect of 
rs7202116 on BMI variance, we did an approximate calculation of its 
effect on the variance of weight. If we take the mean height of 1.78 m 
for men and 1.65 m for women, the difference in the variance of weight 
between the two genotype groups is roughly 16 kg” in both men and 
women (Supplementary Table 2). For example, if the standard devi- 
ation (SD) of weight is 15 kg for men, the predicted SD of weight in the 
two homozygous genotype classes is 14.73 and 15.27 kg, respectively. 

The effect of a SNP on variance could be owing to our use of the z” 
value as a measure of variance or to a general relationship between 
mean and variance of BMI'”’. Below we present evidence that excludes 
these two explanations. 


Table 1 | Associations of the top 6 and 7 loci with variance of height and BMI, respectively 


Chr. SNP bp Nearest gene CA Discovery 

Freq. B SE P n 
Height 
1 rs6429820 14,210,915 PRDM2 G 0.196 =—0,035 0.0071 1.0 x 10° 129,200 
2 rs6429975 143,002,110 KYNU 7 0.180 —0,0386 0.0074 1.0 x 10° 129,196 
2 1s6748377 45,002,877 SIX3 ce 0.175 —0.038 0.0075 4.0x107’ 129,183 
7 rs10486722 41,778,433 INHBA c 0.339 0.029 0.0060 1.0 x 10° 128,834 
8 rs1026852 3,577,500 CSMD1 G 0.444 —0.029 0.0059 1.0 x 107° 126,363 
14 rs12891343 34,453,301 BAZ1A aL 0.227 0.031 0.0068 5.0 x10 ° 128,725 
BMI 
2 rs12328474 140,638,570 LRP1B G 0.263 —0.038 0.0078 1.2 %10-° 104,640 
2 rs10932241 208,685,200 CRYGD Cc 0.407 0.028 0.0059 2.9 x10 ° 127,597 
4 rs11942401 188,052,244 FAT A 0.140 —0.043 0.0085 43 x10°’ 125,010 
6 rs1418304 82,795,837 IBTK G 0.496 —0.026 0.0057 3.3 x 10° 127,611 
14 rs12894649 102,232,512 RCOR1 Cc 0.057 0.061 0.0126 1.3 x10° 127,080 
14 rs7151545 102,247,397 RCOR1 G 0.057 0.059 0.0126 2.4x 107° 127,080 
16 rs7193144 52,368,187 FTO Cc 0.403 0.030 0.0058 19x1077 127,537 
16 rs7202116 52,379,116 FTO G 0.402 0.035 0.0067 2.0x 1077 95,966 
18 rs620052 37,900,962 PIK3C3 G 0.378 0.033 0.0069 1.6 x10°° 95,971 


The squared z scores (z*) were used to test for association of the top 6 and 7 SNPs with trait variability (height and BMI, respectively). The discovery set consists of 133,154 individuals, and data for in silico 
replication are from another 36,727 samples. At both the FTO and RCOR] loci, the second top SNPs (highlighted in bold) in the discovery set pass the single trait genome-wide significance level (5 x 107 ®) in the 
combined set. B, estimate of additive effect on 2: bp, physical position; CA, coded allele; chr., chromosome; freq., frequency of the coded allele. 
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If an SNP has an effect on the mean, the test statistic for association 
of the SNP with 27 will be inflated, and the non-centrality parameter 
(NCP, ) of the 7’ test under the null hypothesis of no effect on variance 
is: np(1 — p)(1 — 2p)2(a + (1 — 2p)d)*, in which n is the sample size, p 
is the frequency of the coded allele, and a and d are the additive and 
dominance effects, respectively, on the mean difference (Supplemen- 
tary Note). We show by analysis and simulation results based on an 
additive and dominance genetic model that such inflation is inversely 
proportional to the minor allele frequency (MAF) of the SNP; that is, 
SNPs with a lower MAF will tend to have higher test statistics under 
the null hypothesis (Supplementary Fig. 4). However, when we plotted 
the observed test statistics of the confirmed 180 height loci** and 32 
BMI loci” that have the largest reported effects on the mean, we did 
not observe such a trend (Supplementary Fig. 5). We calculated the 
NCP, of the known height and BMI loci given the effects on the mean 
from the published papers”, and the NCP, values of all these known 
loci were smaller than 1 (results not shown). The observed genomic 
inflation factor in the discovery meta-analysis was 1.039 for height and 
1.033 for BMI (Supplementary Fig. 6). This small inflation could be 
due to many SNPs affecting the mean and therefore having a tiny effect 
on Zz (Supplementary Fig. 7), or many SNPs that have an effect on the 
variance that is too small to be significant even with our large sample 
size. Across common SNPs in the genome, variants at the FTO locus 
have the largest effect size on BMI’’. The G allele of the FTO SNP 
rs7202116 has a population frequency of ~0.4 and an additive effect 
on the mean BMI of ~0.1 z-score units*”’. If our significant result at 
the FTO locus is due only to an allelic effect on mean BMI, we would 
expect an allelic effect on variability of ~0.002 (predicted from the 
equation in the Supplementary Note), which is very small compared 
with the observed effect of 0.036. For some traits, the variance changes 
in a predictable manner as the mean changes. In this case, a scale 
transformation, such as a logarithmic transformation, can remove 
effects on the variance when they are simply due to an effect on the 
mean’. We were interested in effects of SNP on variability that would 
remain after a scale transformation, and therefore sought to exclude 
scale effects that could explain our observed association. We per- 
formed further analyses in three data sets each with approximately 
20,000 individuals with individual-level genotype and phenotype data 
available to verify the effects of rs7202116 at the FTO locus on BMI 
variance (Methods and Table 2). We used several tests, including 
Bartlett’s test statistic, to test for the difference in variance between 
the three genotypes. The Bartlett’s test P value was <0.05 in each of the 
three data sets, regardless of whether or not the BMI phenotypes were 
adjusted for the mean difference, logarithm transformed or inverse- 
normal transformed (Table 2). In the combined analysis of the three 


Table 1 | Continued 


LETTER 


data sets totalling 60,624 individuals, the effect of rs7202116 on the 
BMI z* score after adjusting for the mean difference was 0.030 
(P=1.2X10 “) for inverse-normal transformed BMI, 0.065 
(2.310 17) for logarithm-transformed BMI, and _ 0.097 
(8.9 x 10 1°) for BMI without scale transformation (Table 2). The 
decrease of the effect of rs7202116 on BMI z* owing to the adjustment 
of the mean difference was ~0.003, in line with that of ~0.002 as 
predicted from the theory above. Similar conclusions as above can 
be drawn from the further analyses for rs7151545 at the RCOR1 locus 
(Supplementary Table 3). We plotted the test statistics and estimates 
for the effects on the variability in our discovery meta-analysis against 
those for the effects on the mean from the published GIANT meta- 
analyses for height” and BMI”, and did not find any apparent correla- 
tions except for a few outlying SNPs at the FTO locus (Supplementary 
Fig. 7). These results together suggest that the observed effect of the 
FTO SNP on variability is neither a consequence of the effect on the 
mean nor due to the choice of scale, and that our inverse-normal 
transformation is likely to be overly conservative. Results from reported 
quantile regression of untransformed BMI ona multiple SNP predictor 
of BMI and on FTO” are consistent with our results but are also 
consistent with scale effects due to the skewed distribution of untrans- 
formed BMI. We have shown in this study that the effect of FTO on 
variability is not due to a scale effect and, concordantly, a quantile 
regression of both transformed and untransformed BMI z-scores on 
the SNPs at the FTO and RCOR1 loci on BMI on 17,974 individuals 
shows a relationship between effect size and the quantile of the distri- 
bution (Supplementary Fig. 8). By contrast, the use of untransformed 
BMI induces widespread correlation between estimated SNP effects on 
the mean and on variance (Supplementary Fig. 9). 

We have reported a meta-analysis of GWAS of squared normalized 
residuals for two quantitative traits in human populations, and provide 
empirical evidence that the FTO and RCOR1 loci influence phenotypic 
variance of obesity. Conversely, we did not observe any significant 
SNPs for height or any significant SNPs other than those at the FTO 
and RCORI loci for BMI to be genome-wide significantly associated 
with phenotypic variance (Table 1), even for those loci known to have 
effects on the mean (Supplementary Fig. 5), which indicates that SNP 
effects on variance are uncommon for height and BMI, and those 
previously identified SNP effects on the mean, although very small, 
are robust to environmental perturbation. We provide evidence that 
the association between the FTO locus and BMI variability is not due to 
artefacts such as scale or ascertainment. We also discuss that it is 
implausible that the observed effect of the FTO SNP on variance is 
due to its strong linkage disequilibrium (D' = 1) with a causal variant 
that has a large effect on the mean (Supplementary Note). The FTO 


In silico replication Combined 

Freq. B SE P n B SE P 7 

0.209 —0.002 0.0131 89x10} 32,355 =Q027 0.0062 1.0 x10°° 161,555 
OlT7 —0.002 0.0137 8.9x1071 32,472 —0.028 0.0065 1.0 x10-° 161,668 
0.185 —0.006 0.0138 6.7 x1071 31,988 =—0.031 0.0066 3.0 x 10°& 161,171 
0.318 —0.005 0.0112 6.3 x1071 32,416 0.021 0.0053 6.0 x 10°® 161,250 
0.435 —0.004 0.0110 74x1071 31,837 —0.023 0.0052 7.0 x10 158,200 
0.225 0.012 0.0120 3.2x1071 36,150 0.027 0.0059 6.0 x 10° 164,875 
0.250 0.035 0.0152 2.0 x10? 32,403 =0.023 0.0069 1.1 x 10-3 137,043 
0.411 —0.006 0.0125 6.21074 28,641 0.022 0.0053 5.6 x10°° 156,238 
0.128 0.003 0.0187 8.5 x1071 28,016 =0.035 0.0077 6.2 x 10° 153,026 
0.493 0.004 0.0103 6.9x1071 36,721 =—0.019 0.0050 1:2:°« 1074 164,332 
0.050 0.058 0.0248 1.9 x 10-2 32,298 0.060 0.0112 79x108 159,378 
0.053 0.083 0.0285 3.6 x 10-3 28,040 0.063 0.0115 4.1x 10-8 155,120 
0.406 0.020 0.0115 8.0 x 10-2 32,449 0.028 0.0052 54x108 159,986 
0.417 0.039 0.0107 2.9 x 10-4 35,267 0.036 0.0057 2.4 x 1071° 131,233 
0.382 —0.010 0.0111 3.7 x1071 34,668 0.021 0.0059 3.5x104 130,639 
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Table 2 | Effects of the FTO SNP rs7202116 on BMI 


BMI log(BMI) BMI (inv. norm.) 
Unadj. Adj. Unadj. Adj Unadj. Adj 
WGHS (n = 22,888) 
8 0.148 0.142 0.100 0.093 0.046 0.040 
SE 0.021 0.020 0.015 0.015 0.013 0.013 
P 4.5 x107}8 4.0 x10!” 5.5 x10713 8.6 x 10°19 68 x10+ 3.3x10% 
Permutation P <1 x10~* <1 x10~4 <1 x10 <1 x10~4 9.0 x 10-4 3.9 x 10-3 
Bartlett’s P 1.1 x 10°24 1.1 x 10°24 2.0 x 10713 2.0 x 1013 6.5 x10 6.6 x 10-3 
Mean AA —0.070 0.0 —0.069 0.0 —0.068 0.0 
Mean AG —0.001 0.0 —0.001 0.0 0.0 0.0 
Mean GG 0.161 0.0 0.159 0.0 0.152 0.0 
Variance AA 0.895 0.900 0.932 0.937 0.971 0.977 
Variance AG 1.002 1.008 0.995 1.001 0.990 0.996 
Variance GG 1.194 1.202 1.132 1.138 1.060 1.066 
EPIC (n= 19,762) 
8 0.077 0.076 0.049 0.048 0.027 0.026 
SE 0.021 0.021 0.017 0.017 0.014 0.014 
P 1.7 x 10+ 2.1 x 104 3.2 x 10% 3.9 x 10% 6.1 x 10-2 7.1 x 10°? 
Permutation P <1 x10+ <1x 10-7 49x10? 5.1 x 10°3 64 x 10°? 7.1 x 10°? 
Bartlett’s P 7.6 x 1077 7.6 x 1077 3.0 x 10-3 3.0 x 1073 1.2 x 107! 1.2 x 107! 
Mean AA —0.077 0.000 —0.076 0.000 —0.075 0.000 
Mean AG 0.012 0.000 0.012 0.000 0.012 0.000 
Mean GG 0.103 0.000 0.102 0.000 0.100 0.000 
Variance AA 0.932 0.936 0.951 0.955 0.967 0.970 
Variance AG 1.005 1.009 1.007 1.011 1.010 1.013 
Variance GG 1.085 1.089 1.045 1.049 1.013 1.017 
ARIC + QIMR + NHS + HPFS (n= 17,974) 
8 0.070 0.067 0.049 0.046 0.026 0.024 
SE 0.022 0.022 0.017 0.017 0.015 0.015 
P 1.7 x 10% 2.8 x 10-3 3.6 x 103 6.1 x 10°3 8.9 x 10-2 1.2 x 1071 
Permutation P 1.6 x10% 2.6 x 10°3 3.8 x 10% 7.1 x 10-3 8.7 x10? 1.2 x 1071 
Bartlett’s P 1.2 x 1077 L25e10-% 2.5 x 10-4 2.5 x 1074 2.0 x 10-2 2.0 x 1072 
Mean AA —0.067 0.0 —0.068 0.0 —0.069 0.0 
Mean AG 0.006 0.0 0.008 0.0 0.010 0.0 
Mean GG 0.122 0.0 0.118 0.0 0.113 0.0 
Variance AA 0.968 0.973 0.978 0.983 0.994 0.998 
Variance AG 0.968 0.972 0.974 0.978 0.975 0.979 
Variance GG 1.131 1.136 1.093 1.097 1.059 1.064 
Combined (n = 60,624) 
8 0.100 0.097 0.068 0.065 0.034 0.030 
SE 0.012 0.012 0.009 0.009 0.008 0.008 
P 8.9 x10°1” 8.9 x10 16 14 x10738 2.3 x 10 1% 24x 10° 1.2 x 10-4 
Bartlett’s P 1.3 x 10-3 1.3 x 10-3 85x10 15 8.6 x10 1° 44 x10%4 42x10%4 
Mean AA —0.071 0.0 —0.071 0.0 —0.070 0.0 
Mean AG 0.005 0.0 0.006 0.0 0.007 0.0 
Mean GG 0.129 0.0 0.127 0.0 0.122 0.0 
Variance AA 0.93 0.93 0.95 0.96 0.98 0.98 
Variance AG 0.99 1.00 0.99 1.00 0.99 1.00 
Variance GG 1.14 1.14 1.09 1.09 1.04 1.05 


The effects of the FTO SNP rs7202116 on the variance for BMI and log(BMI) were tested in three subsets of data. The BMI phenotypes were corrected for age effect and standardized to z scores using the mean and 
standard deviation, or by an inverse-normal (inv. norm.) transformation in each gender group in each cohort. Phenotypes were adjusted (adj.) (or unadjusted (unadj.)) for mean difference in the three genotypes. 
For the EPIC cohort, 2,397 samples were in the meta-analysis, and 17,376 were not part of the meta-analysis. For the combined ARIC, QIMR, NHS and HPFS cohort, 12,741 samples were in the meta-analysis and 
5,233 samples were not. B, the effect of the G allele on z*; Bartlett’s P, P value calculated from the Bartlett's test for variance difference in the three genotypes; EPIC, European Prospective Investigation into Cancer; 
HPFS, Health Professionals Follow-up Study; NHS, Nurses’ Health Study; permutation P, empirical P value calculated from 10,000 permutations; QIMR, Queensland Institute of Medical Research; WGHS, 


Women’s Genome Health Study. 


SNPs that are associated with variance are also associated with mean 
differences in BMI. Interestingly, this phenomenon seems to be 
restricted to the FTO gene and to obesity, because we did not observe 
such effects for height or for BMI at loci other than FTO. One possible 
explanation of the observation is a differential response to physical 
activity’®, because interactions between FTO genotypes and physical 
activity have been reported for the same SNPs as we report in this 
study: the G allele that is associated with an increase in mean BMI has a 
smaller effect in the group of people with a high level of physical 
activity than in the absence of physical activity**”**. There may be 
other unknown lifestyle factors, including diet, that also interact with 
the FTO genotype and result in the observed effect on variability. 
We do not provide a mechanism of how alleles at FTO influence 
variability (how FTO alleles affect the mean is also not known). 
However, the fact that the allele that increases obesity also increases 
variability suggests a breakdown of homeostatic control. Data on mice 
lacking the Fto gene suggest that the observed effects on mean obesity 
in humans may be due to upregulation or dysregulation of FTO 
expression, resulting in an increased susceptibility to obesity”. If both 
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upregulation and impairment of FTO expression have a role then this 
could provide a mechanism of the observed effect on variability. The 
FTO protein affects demethylation of nuclear RNA in vitro”, but 
whether the efficiency of this process depends on the FTO genotype 
or how this may be related to the observed effects on BMI is not clear. 
Notably, a recent study reported that rs7202116 allele G, which is 
present on the obesity-susceptibility haplotype at the FTO locus, 
creates a CpG site along with other variants in perfect linkage dis- 
equilibrium with it’, and therefore risk alleles have increased DNA 
methylation. In addition, it was reported that a CpG site in the first 
intron of FTO showed significant hypomethylation in type 2 diabetes 
cases relative to controls*’, and that the risk variant seems to have an 
effect on methylation status at other genes°. DNA methylation can be 
affected by environmental influences, including dietary and lifestyle 
factors, and may affect gene expression. For example, physical exercise 
may increase gene expression at the FTO locus, but less so in GG 
individuals compared with AA individuals because their alleles are 
more methylated. This therefore suggests a possible mechanism for 
the observed effects on both the mean and variability. However, more 
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research is needed to determine the molecular effect and mechanism of 
FTO on both the levels and variability of obesity. 

Overall, our findings are consistent with a low heritability of pheno- 
typic variability’ and no common genetic variants that account for a 
large proportion of variation in environmental or phenotypic variability. 
They also indicate an absence of widespread genotype-by-environment 
interaction effects, at least for height and obesity in humans and with 
interaction effects large enough to be detected in our study in which 
specific environmental factors were not identified. Nevertheless, the 
demonstration that individual genetic loci with effects on variability 
can be identified with sufficiently large sample sizes facilitates further 
study to understand the function and evolution of the genetic control of 
variation. 


METHODS SUMMARY 


We performed a meta-analysis of 51 GWAS with 169,881 individuals of European 
ancestry, and ~2.44 million genotyped or imputed SNPs after quality control. In 
each study, association analysis of each SNP with height and BMI z* was per- 
formed after adjustment for covariates and followed by an inverse-normal trans- 
formation. We meta-analysed the association results of each SNP from 38 studies 
with 133,154 individuals as a discovery set, and validated the top SNPs identified in 
the discovery set with association Pvalues <5 10° in a separate sample of 
36,727 individuals from 13 studies. Further analyses using individual-level geno- 
type and phenotype data to test for difference in variance of BMI between the three 
groups for the top SNPs at the FTO and RCOR1 loci were performed on 60,624 
individuals, including 22,598 individuals who were not part of the meta-analysis. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Fifty-one studies were included in the meta-analysis. All individuals were of recent 
European descent. In each of the participating studies, genotyped SNPs that passed 
standard quality-control processes (missingness, Hardy-Weinberg equilibrium 
test and MAF) were used to impute the ungenotyped SNPs to the HapMap II 
CEU reference panel’. We excluded SNPs with imputation quality score <0.4 
for IMPUTE” and <0.3 otherwise****. A summary of sample size, genotyping 
platform, quality-control filters and the imputation tool of all the participating 
studies is provided in Supplementary Table 4. We further excluded SNPs with 
MAF < (0.01 in each study or in the meta-analysis, and retained about 2.68 million 
autosomal SNPs in the analysis. 

In each study, height and BMI phenotypes were adjusted for age and standardized 
to z score by an inverse-normal transformation. The analysis protocol supplied to all 
cohorts is given as a Supplementary Note. The descriptive statistics of phenotypes of 
each study are shown in Supplementary Table 5. The association analyses of pheno- 
typic variability were performed on a single-SNP basis by the following additive 
genetic model: y = « + fx + e, in which y is 2°, « is the intercept, [ is the additive 
SNP effect on z*, xis the allelic dosage coded as 0, 1 or 2 for the three genotype groups, 
and ¢ is the residual. We stratified the analysis by gender group and/or case-control 
status where applicable. We selected 38 studies consisting of 133,154 individuals as 
the discovery set by the time when data were available. We collected summary-level 
association results of all the SNPs from these studies and adjusted the standard errors 
of all SNPs by the genomic control approach in each study”’, that is, multiplying the 
standard errors of the estimates of f by the square root of the genomic inflation 
factor*!. We then combined the effect of each SNP by an inverse-variance meta- 
analysis implemented in METAL”. In a regression analysis, the squared standard 
error of the estimate of a SNP effect is: 07/ (2p(1 — p)n), in which o is the residual 
variance, p is the frequency of the coded allele, and n is the sample size. This assumes 
Hardy-Weinberg equilibrium of genotype frequencies. If the effect size is small, a” is 
approximately equal to the variance of y, which is 2. We checked the overall quality of 
each study by plotting the median of 1/SE across all SNPs against the reported sample 
size, and by plotting the median of 2p(1 — p)nSE” across all SNPs to see if it was close 
to 2 (Supplementary Fig. 10). We further estimated the effective sample size of each 
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SNP by: # = 2/(2p(1 - p)SE”), using the summary statistics of the whole discovery 
set, and excluded SNPs with i < mean(/i) — 2SD(/) and retained ~2.44 million 
SNPs for both height and BMI. We collected data from a further 36,727 samples 
from 13 cohorts (Supplementary Tables 4 and 5), and validated the top SNPs at 6 
associated loci for height and 7 for BMI (P<5 X 10. °) in these extra samples. 

We performed further analyses in three data sets with a total sample size of 60,624 
with individual-level genotype and phenotype data to verify our findings. These 
three data sets include 22,888 individuals from the WGHS cohort, and 19,762 
individuals from the EPIC cohorts, and a combined sample of 17,974 individuals 
from the ARIC, QIMR, NHS and HPFS cohorts, with 17,365 individuals from the 
EPIC cohort and 5,233 individuals from the NHS and HPFS cohorts not part of the 
meta-analysis. We used logarithm or inverse-normal transformation to remove a 
possible mean-variance relationship of BMI phenotypes, and adjusted the pheno- 
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Controlling interneuron activity in Caenorhabditis 
elegans to evoke chemotactic behaviour 


Askin Kocabas'"**, Ching-Han Shen'**, Zengcai V. Guo? & Sharad Ramanathan’**>>° 


Animals locate and track chemoattractive gradients in the environ- 
ment to find food. With its small nervous system, Caenorhabditis 
elegans is a good model system’” in which to understand how the 
dynamics of neural activity control this search behaviour. Extensive 
work on the nematode has identified the neurons that are necessary 
for the different locomotory behaviours underlying chemotaxis 
through the use of laser ablation* ”’, activity recording in immobilized 
animals and the study of mutants**. However, we do not know the 
neural activity patterns in C. elegans that are sufficient to control its 
complex chemotactic behaviour. To understand how the activity in 
its interneurons coordinate different motor programs to lead the 
animal to food, here we used optogenetics and new optical tools to 
manipulate neural activity directly in freely moving animals to 
evoke chemotactic behaviour. By deducing the classes of activity 
patterns triggered during chemotaxis and exciting individual 
neurons with these patterns, we identified interneurons that control 
the essential locomotory programs for this behaviour. Notably, 
we discovered that controlling the dynamics of activity in just 
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one interneuron pair (AIY) was sufficient to force the animal to 
locate, turn towards and track virtual light gradients. Two distinct 
activity patterns triggered in ATY as the animal moved through the 
gradient controlled reversals and gradual turns to drive chemotactic 
behaviour. Because AIY neurons are post-synaptic to most chemo- 
sensory and thermosensory neurons’, it is probable that these 
activity patterns in AIY have an important role in controlling and 
coordinating different taxis behaviours of the animal. 

Organisms, from bacteria to multicellular eukaryotes, have to search 
for food to survive. Complex internal circuits process external signals 
to evoke and coordinate multiple motor programs, leading the animal 
to track attractive odours and find food. Whether there are master 
nodes in the circuits that control and coordinate search behaviour, 
and whether the neural circuits generating chemotactic behaviour in 
C. elegans can be controlled through such key nodes are important 
questions. 

The nematode C. elegans uses reversals (backward movement) and 
sharp and gradual turns to locate and track gradients of chemoattractive 
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Figure 1 | Asymmetric component of the odour signal controls gradual 
turning. a, The trajectory of the nose tip of a nematode overlaid on a modelled 
exponential profile (decay length, 1 cm) of the gradient of chemoattractants 
(pseudocolour) from the bacterial lawn. Plus signs denote the position of the 
animal at the indicated times. a.u., arbitrary units. b, Illustrations of an animal 
crawling perpendicular to (top) and along (bottom) the odour gradient. h, 
head-bending angle; D, dorsal (h > 0); V, ventral (h < 0). c, Odour signal at 
nose tip, I(f), versus head-bending angle, h(t), over the last (black) and 
sequentially earlier (grey) head swings for the nose-tip trajectory in a. d, Plot of 


the mean (black line) and s.d. (coloured bar) of the asymmetric (A) and 
symmetric (S) components of I(t) in c. |A| denotes the magnitude of A(t). 

e, Dorsal asymmetric odour stimulation (see Methods). f, g, Sample trajectories 
of the centre of mass of the animals upon dorsal and ventral asymmetric (f) and 
symmetric (g) odour stimulation. Angles define the turning angles with respect 
to the initial orientation of the animal. Grey bar denotes the mean and error 
bars denote s.d. over n = 10 experiments. B, back; F, front. **P < 0.05, two- 
sample t-test. 


IFAS Center for Systems Biology, Harvard University, Cambridge, Massachusetts 02138, USA. Department of Molecular and Cellular Biology, Harvard University, Cambridge, Massachusetts 02138, USA. 
3Janelia Farm Research Campus, Howard Hughes Medical Institute, Ashburn, Virginia 20147, USA. “Allen Institute for Brain Science, Seattle, Washington 98103, USA. °Harvard Stem Cell Institute, Harvard 
University, Cambridge, Massachusetts 02138, USA. School of Engineering and Applied Sciences, Harvard University, Cambridge, Massachusetts 02138, USA. 


*These authors contributed equally to this work. 


11 OCTOBER 2012 | VOL 490 | NATURE | 273 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


signals***"°. Previous work on C. elegans has identified around 14 pairs 
of interneurons and motor neurons, including the interneuron pairs 
AIY, AIZ and AIB that are necessary for the locomotory behaviours 
underlying chemotaxis*” (Supplementary Table 1). The neuroanatomy 
of the animal shows that most of the amphid chemosensory and 
thermosensory neurons synapse onto one or more neurons of the first 
layer of the interneuron pairs AITY, AIZ and AIB", which are further 
connected to a dense network of interneurons’. The activity dynamics 
in this network must process sensory signals to produce and coordinate 
the different locomotory behaviours underlying chemotaxis through 
the downstream motor neurons. Despite the experiments in the 
literature involving ablation, genetics and calcium imaging, we do 
not know whether chemotaxis is driven by key interneurons or 
whether the generation of this complex behaviour is achieved by the 
dynamics of a more diffuse neural network. 
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To evoke chemotactic behaviour by directly controlling interneuron 
activity, we have to answer the intricately linked questions of which 
sets of interneurons to control, and which activity patterns to stimulate 
in them. To deduce the classes of activity patterns triggered in the 
nervous system during chemotaxis, we followed animals as they 
crawled towards a bacterial lawn (Fig. 1a). The undulatory head swings 
(from dorsal to ventral, as the animal crawls on its side) caused the 
angle at which the head bends relative to the locomotory direction, h(t) 
(Fig. 1b), to oscillate between positive and negative values. Owing to 
this changing head-bending angle and the movement of the animal, 
sensory cilia at the nose tip experienced the spatial profile of the 
chemoattractants (Fig. 1a, b) as a temporally fluctuating odour signal, 
I(h,t). In general, this signal can be written as a sum of two terms, 
I(h,t) = S,(h,t) + A;(h,t), in which S; is a symmetric function of h: 
S(h,t)=S(—h,t) and A; is an asymmetric function of h: 
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Figure 2 | Asymmetric and symmetric excitation of AIY control gradual 
turning and reversal frequency. a, Set-up for closed-loop single-neuron 
stimulation (see Methods). DLP, digital light processing; EMCCD, electron- 
multiplying charge-coupled device; LED, light-emitting diode. b, Sample 
trajectories of the centre of mass of the animals after dorsal or ventral 
asymmetric excitation of AIY (n = 10; ttx-3::ChR2). c, Fluorescence image 
overlaid on the bright-field image of a nematode co-expressing ChR2 and mKO 
in AIY, AIZ and RME neurons (ser-2prom2::mKO and ser-2prom2::ChR2). 

d, Asymmetric dorsal stimulation of the animal in c. On dorsal head bending 
(top left), ChR2 and mKO were excited in ATY using the set-up in a (top right, 
higher fluorescence in AIY) but not on ventral head bending (bottom right, 
decreased fluorescence in ATY). Left, 1 dark-field images of the animal; right, 
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X15 fluorescent images of the neurons in the same animal. e, Sample 
trajectories of the nose tip of the animals on dorsal (left) or ventral (right) 
asymmetric excitation of AIY (n = 10; ser-2prom2::ChR2). f, Turning angle of 
animals after asymmetric stimulation of ATY::ChR2 (n = 10), ATY::Arch 

(n = 10) and AIB::Arch (n = 7). ATR, all-trans retinal (co-factor required for 
functional light-gated channels ChR2 and Arch). ‘Dorsal’ indicates asymmetric 
dorsal stimulation, ‘ventral’ indicates asymmetric ventral stimulation, ‘no light’ 
indicates unstimulated neurons, and ‘no ATR indicates control experiments. 
g, Reversal frequencies upon symmetric stimulation of ATY::ChR2 (n = 10), 
AITY::Arch (n = 19), AIB::ChR2 (n = 14) and AIB::Arch (n = 11). White and 
blue bars in f and g denote the mean, error bars denote 1 s.d. over n animals. 
**P < 0.05, two-sample t-test). 
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A(h,t) = —A(—h,t). When the animal moves perpendicular to the 
gradient direction, I is dominated by A, (Fig. 1b-d, top). As the animal 
turns and tracks the gradient, the magnitude of A;,|A;| decreases and I 
is dominated by S; (Fig. 1b-d, bottom and Supplementary Fig. 1a). 
To determine how these asymmetric and symmetric components of 
I(t) control locomotory behaviour, we built a microscopy system that 
delivers odours on a freely crawling animal in precise temporal 
patterns determined by h(t) (Supplementary Fig. 1b). To mimic A;, 
we exposed the animal to asymmetric odour stimulation: air with 
chemoattractant vapours (1 X 10 °M isoamyl alcohol) blown on 
the entire animal when the head was bent in one direction (for 
example, dorsally, h >0) and odour-free air blown when the head 
was bent in the other direction (ventrally, h < 0) (Fig. le). The animal 
turned gradually in the direction in which its head was bent when the 
odour was delivered (Fig. 1f, Supplementary Fig. 1c, dand Supplemen- 
tary Movie 1). To mimic S;, we delivered vapours of isoamy] alcohol 
constantly, independent of h. The animal reduced its reversal 
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frequency" and did not turn (Fig. 1g). The most parsimonious hypo- 
thesis based on these results is that asymmetric and symmetric odour 
components generate activity patterns in the nervous system with 
corresponding symmetries to control turning and reversal frequency 
separately during chemotaxis. 

We therefore identified interneurons that triggered the different 
locomotory behaviours necessary for chemotaxis by directly stimulat- 
ing individual neurons in a freely moving animal in asymmetric and 
symmetric patterns. To do so, we expressed channelrhodopsin-2 
(ChR2)'*”? or archaerhodopsin-3 (Arch)'*'® in different neurons. 
Light activation of ChR2 (by 480 nm light) and Arch (540 nm) leads 
to neural excitation and inhibition, respectively. 

Targeted illumination of specific neurons in motionless animals'® 
and of body segments in freely moving animals’”'* have been 
developed to excite neurons for which specific promoters are not 
known. Because the neurons in the nerve ring are as close as 
5-10 um to each other and their relative positions change quickly as 
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Figure 3 | Asymmetric ATY excitation modulates the head-bending angle to 
cause turning. a, Representative head-bending angle (horizontal lines denote 
means of maximum and minimum h during each full head swing) of an 
AIY::ChR2 (ttx-3::ChR2) animal forced to first turn ventrally and then dorsally 
(positive angles, dorsal; negative angles, ventral). b, Histogram of head-bending 
angle bias (max(h) + min(h)) during dorsal and ventral turn caused by 
asymmetric AIY excitation. Error bars denote s.d. over n = 5. ¢, d, Sample 
tracks of animals on dorsal (left) or ventral (right) asymmetric excitation of 
AIZ::ChR2 (ser-2prom2::ChR2) (c), and SMB::ChR2 (odr-2(18)::ChR2) 


(d) animals. e, Turning angle upon asymmetric stimulation with odour 

(n = 10) and optically of AWC N.-Arch (n= 10), AWCON::ChR2 (n=9), 
AIY::ChR2 (n = 10), AIZ::ChR2 (n = 5), SMB::ChR2 (n = 5) and RME::ChR2 
(n = 5). ‘Dorsal’ indicates asymmetric dorsal stimulation, ‘ventral’ indicates 
asymmetric ventral stimulation and ‘no light’ indicates unstimulated neurons. 
Error bars denote s.d. over n. **P < 0.05, two-sample t-test. f, The mono- and 
polysynapses between AWCON, ATY, AIB, AIZ, SMB and RME (thickness of 
arrows is proportional to synapse number). Triangle denotes sensory neuron, 
hexagons denote interneuron and circles denote motor neurons. 
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the animal moves (Supplementary Fig. 2a), we could not use these 
techniques. To stimulate one of many neurons optically (each with a 
diameter of 5-10 1m) expressing light-gated ion channels in the nerve 
ring of an animal that typically moves at 150 + 50 ums‘ (Supplemen- 
tary Fig. 2b), our set-up tracks, identifies and specifically illuminates 
the neuron(s) of interest, all within 25 ms, to achieve a 5-1m spatial 
resolution of excitation (Fig. 2a). 

Using this set-up, we first tested how stimulating the interneurons 
AIY and AIB affected locomotory behaviour. Both neuron pairs 
receive chemical synapses from the AWC sensory neurons that detect 
isoamyl alcohol® and showed calcium activity when animals were 
stimulated with this chemoattractant’. Asymmetric excitation of 
AIY with light in animals that expressed ChR2 only in AIY (under 
the promoter ttx-3) caused the animal to turn in the direction in which 
the head was bent when ATY were excited (Fig. 2b and Supplementary 
Movie 2). We validated our set-up by reproducing these results in 
animals that expressed ChR2 in AIY alone and the fluorescent 
protein monomeric Kusabira-Orange (mKO) in neurons AIY, AIZ 
and RME (under the promoter ser-2prom2, Supplementary Fig. 2c-f). 
Asymmetric stimulation of AIY in animals expressing ChR2 in ATY, 
AIZ and RME (ser-2prom2) showed the same results (Fig. 2c-f, 
Supplementary Fig. 2g and Supplementary Movie 3). Consistently, 
inhibiting the activity in AIY asymmetrically (ttx-3::Arch) caused the 
animal to turn in the opposite direction in which the head was bent 
when AIY were inhibited (Fig. 2f, Supplementary Fig. 3a and 
Supplementary Movie 4). Symmetric excitation and inhibition of ATY 
decreased and increased the reversal frequency respectively, but did 
not cause turning (Fig. 2g, Supplementary Fig. 3 and Supplementary 
Movie 5). 

Both asymmetric and symmetric excitation or inhibition of AIB in 
AIB::ChR2 and AIB::Arch animals (in which ChR2 and Arch are 
expressed under the npr-9 promoter) affected the reversal frequency 


of the animal but did not produce any gradual turning (Fig. 2f, g and 
Supplementary Fig. 4). We could thus control the two locomotory 
behaviours crucial for chemotaxis—gradual turns and reversal fre- 
quency—by driving different patterns of activity in AIY alone. 

Turning is initiated by a larger head-bending angle in one dir- 
ection’’’”’. When we forced the animal to turn by asymmetrically 
stimulating ATY (ttx-3::ChR2), the head-bending angle in the direction 
of the turn increased (Fig. 3a, b), indicating that asymmetric activation 
of ATY controlled head bending through head motor neurons to cause 
gradual turning. AIY neurons are most directly connected to the head 
muscles through the interneuron pair AIZ, which synapse onto the 
head motor neurons SMB and RME®. AIZ neurons have been pro- 
posed to have a role in gradual turns by laser ablation*. Ablations of 
SMB and RME change the head-bending angle during crawling and 
show loopy behaviour””’. 

When we specifically excited AIZ:ChR2 (ser-2prom2::ChR2), 
SMB::ChR2 (odr-2(18):;ChR2) or RME::ChR2 (ser-2prom2::ChR2) 
asymmetrically using our set-up, the animals turned (Fig. 3c-e, 
Supplementary Fig. 5 and Supplementary Movies 6-8). These results, 
in conjunction with those from asymmetric optical stimulation of the 
isoamyl-alcohol-sensing neuron AWCON (Supplementary Fig. 6 and 
Supplementary Movie 9), show that asymmetric stimulation of the 
sequence of anatomically connected neurons from AWCON, through 
the interneurons AIY and AIZ to the head motor neurons SMB and 
RME’ (Fig. 3f) all cause turning. These sets of neurons thus sense and 
respond to the component of the sensory signal that oscillates asym- 
metrically and in synchrony with head movement to control head 
bending and turning. 

As different patterns of activity in AIY are sufficient to control both 
the frequency of reversals and turning, we tested whether controlling 
AIY activity alone was sufficient to coordinate reversal frequency and 
turning to evoke chemotactic behaviour. To do so, we measured the 
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Figure 4 | Controlling ATY activity is sufficient to evoke chemotactic 
behaviour. a, Virtual light gradient algorithm (see Methods). At each time, ¢, 
ATY::ChR2 animals (ttx-3::ChR2) are stimulated with 480 nm blue light with an 
intensity (I(n,,ny)) of the virtual gradient at the nose-tip position (1,,n,). 

b, Trajectories of ATY::ChR2 animals moving in a virtual light gradient (as in 
a) with a gradient direction at 45° (black arrowheads denote the mean direction 
of the trajectory. Grey bar denotes s.d. (n = 10). c, Top, a sample trajectory of an 
animal in b. Bottom, snapshots of the animal making a gradual turn to reorient 
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locomotory behaviour of animals in a fixed spatial light gradient that 
directly excited AITY::ChR2 (Supplementary Fig. 7a and Supplementary 
Methods). The animals were unable to track the gradient direction 
(Supplementary Fig. 7c and Supplementary Movie 10). 

The somas and processes of AIY are 150 + 25 um behind the nose 
tip of an adult animal (Supplementary Fig. 7b). As the speed of the 
animal is 150 + 50 um s ', ATY neurons follow the position of the 
nose tip with approximately a 1-second delay. We proposed that the 
dynamics of ATY excitation caused by the movement of the animal 
through the gradient were not in synchrony with head bending due to 
this delay, preventing the animals from tracking the gradient. 
Therefore, we designed a virtual light gradient in which the excitation 
light intensity on AIY depended not on the positions of AIY in space 
but on the position of the nose tip (Fig. 4a, Supplementary Fig. 7a and 
Supplementary Methods). 

In this set-up, animals changed their locomotory direction using 
reversals and gradual turns to track the gradient direction stably (the 
fraction of the animals moving up the gradient and hence to the correct 
quadrant (defined as chemotaxis index”) = 0.94, Fig. 4b). As with the 
odour profile at the nose tip (Fig. 1d), the temporal light-intensity 
pattern that excited AIY could be written as a sum of an asymmetric 
(A,(h,t)) and a symmetric (S;(h,t)) component. When the locomotory 
direction of the animal was not along the direction of the light gradient, 
the magnitude of A;(h,t) over each head swing, |A| = \/ (A?(h,t)), was 
larger, exciting AITY asymmetrically to make the animal turn. As the 
animal oriented itself along the direction of the gradient, the mag- 
nitude of A;(h,t) continuously diminished, suppressing turns, whereas 
the magnitude of S;(h,t), |S], increased and suppressed reversals 
(Fig. 4c, d). Thus, manipulating the dynamics of activity in just the 
AIY interneuron pair is sufficient to evoke chemotactic behaviour. 
This is because the head bending and locomotion of the animal 
through the virtual light gradient together generate and modulate 
the levels of symmetric and asymmetric excitation of AIY which, in 
turn, control future locomotory behaviour (Fig. 4e). 

This model would predict robust chemotactic behaviour in the light 
gradient stimulating AIY, with the symmetric and asymmetric com- 
ponents modulating their relative magnitudes to guide the animal 
stably in the correct direction. To test for robust tracking, we suddenly 
rotated the virtual light gradient direction by different angles and 
measured the response of the animal. The animals followed the gra- 
dient direction as this direction was suddenly and repeatedly rotated 
by 180° (Supplementary Movie 11), 120° or 90° (Fig. 4f). 

Previous studies have identified neurons involved in chemotaxis by 
showing that defects in these neurons compromise locomotory pro- 
grams and sensory modalities necessary for this behaviour. Through 
our approach we can identify key neurons in the neural network, the 
dynamics of which are sufficient to drive chemotactic behaviour and 
hence act as control nodes in the network. Our study leads to questions 
of how activity patterns stimulated in the AIY neurons during chemo- 
taxis are disentangled by the downstream neurons to drive the different 
motor programs. Because many chemosensory and thermosensory 
neurons synapse onto ATY interneurons, it is likely that the modulation 
of symmetric and asymmetric activity patterns in these interneurons 
have a central role in the different taxis behaviours of C. elegans. Our 
techniques provide avenues to identify and generate neural activity 
patterns to control all of the behaviours of this nematode. 


METHODS SUMMARY 

Transgenic lines were constructed’ in the pha-1 (ref. 22) and lite-1 (ref. 23) back- 
ground and maintained using standard molecular biology techniques (see 
Methods). The details of the data analysis to quantify chemotactic behaviour, 
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reversal frequency and the set-up for odour stimulation, single neuron stimulation 
and virtual light gradients are explained in Methods. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Strains were grown and maintained under standard conditions’ unless indicated 
otherwise. Transgenic lines with a pha-1 selection marker were grown at 24°C 
(ref. 22). All optical stimulation experiments were done in lite-1 mutants to 
minimize the sensitivity of the animal to blue light’’. A complete strain list and 
information on transgenes are included in Supplementary Table 2. 

Chemotaxis analysis. Animal chemotaxis towards a bacterial lawn was assayed on 
an open-lid, 10-cm nematode growth medium (NGM) plate incubated at room 
temperature (22 °C) overnight with 10 pl Escherichia coli strain OP50 at the centre. 
N2 strain young adults were placed on the plate 1.5 cm away from the centre of the 
bacterial lawn. Animal behaviour was recorded under X6 magnification by an 
EMCCD camera at 20 Hz and analysed by customized LabView scripts. 
Odour-stimulation set-up. An N2 young adult was placed on an open-lid, food- 
free 10-cm NGM plate for at least 1 min. An electric valve (Supplementary Fig. 1b) 
was used to determine whether the air (100 standard cubic centimetres per minute 
(s.c.c.m.)) was bubbled through water or 1 X 10 * M isoamy] alcohol on the basis 
of the posture of the freely moving animal. The images were recorded under X6 
magnification by an EMCCD camera at 20Hz and analysed by customized 
LabView scripts. 

Single-neuron stimulation set-up. L4-stage animals were transferred to a new 
NGM plate with a thin layer of E. coli OP50, containing 100 1M ATR” if required, 
24h before experiments in room temperature. An animal then was placed on an 
open-lid, food-free 6-cm NGM plate for at least 1 min. Dark-field 660 nm illu- 
mination was used to visualize the posture of the animal under X1 magnification 
by a video camera at 20Hz (Fig. 2a, computer 2). Low-power 540nm 
(0.1 mW mm”) epifluorescent illumination was used to visualize the neurons 
co-expressing light-gated ion channels and mKO” at X15 by an EMCCD camera 
at 40 Hz (computer 1). Image thresholding and particle detection were used as the 
image-processing algorithms to identify the mKO-tagged neurons. As the animal 
swung its head, the neurons in the field rotated and changed their positions. To 
offset the rotation effect, positions of the neurons were measured using the 
principal axis and the distance from the centre of mass of the processed image. 
The processed images were then used to track the animal and to position the DLP 
mirrors to deliver light (4 mW mm ~*, 480 or 540 nm) on the neurons of interest 
with any desired temporal patterns to excite or inhibit the activity of the neurons. 
Feedback between the motorized stage, DLP mirrors and image-processing soft- 
ware was operated at 40 frames! to achieve a 5-tm spatial resolution of excita- 
tion ona freely moving animal. The images were processed, recorded and analysed 
by customized LabView scripts. 

Reversal frequency. L4-stage animals were transferred to a new NGM plate with a 
thin layer of E. coli OP50, containing 100 1M ATR if required, 24 h before experi- 
ments in room temperature. A 6-cm copper ring was placed in an open-lid, food- 
free 10 cm NGM plate immediately before the experiments to keep the animals in 
the field of view. Young adults were then transferred to the assay plate for 1 min 
before the experiment started. The desired wavelength of light (1 mW mm _ ”, 480 
or 540 nm) was delivered in alternate 3-min intervals for 1h using customized 
LabView scripts. The experiments were recorded by a video camera at 20 Hz. 


Reversal frequency was calculated from pirouettes determined by an automated 
worm tracker (http://wormsense.stanford.edu/tracker)”*. 

Virtual light gradient set-up. An AIY::ChR2 animal was placed on an open-lid, 
food-free 10 cm NGM plate for at least 1 min before starting the experiment. A 
virtual gradient of light, I(r) = exp (—1?/r2), from 0 to 1 mW mm ” over 1.3 mm, 
where rp = 0.8 mm, was defined in an x-y coordinate system tied to the centre of 
mass of the animal, which was always at the centre of the gradient profile 
(0.65 mm, 0.65 mm) (Supplementary Fig. 7a). The virtual light gradient moved 
with the centre of mass of the animal but at a fixed orientation. At each time (t), the 
coordinates of the nose tip were identified to calculate the corresponding intensity 
of light I(n,,n,) (Fig. 4a). The animal was then illuminated with blue (480 nm) 
light at intensity I(n,,ny), and thus ATY instantaneously experienced the intensity 
of light at the nose tip. 

Molecular biology. Arch::EGFP™ was cloned into the Fire Lab vector kit plasmid 
pPD96.52. Chop-2(H134R)::TagRFP was obtained by swapping red fluorescent 
protein TagRFP with yellow fluorescent protein (YFP) in the previously cloned 
chop-2(H134R)::YFP'*. Arch::TagRFP and Arch::mKO were obtained, respectively, 
by swapping TagRFP*° or mKO* with EGFP in Arch::EGFP. TagRFP and GCaMP-3 
(ref. 27) were codon-optimized for C. elegans (de novo synthesized by 
GenScript), whereas others were optimized for mammalian cells. We amplified 
2 kilobases (kb) str-2 (ref. 28), 1kb ttx-3 (ref. 29), 3kb npr-9 (ref. 30), 4.5 kb 
ser-2prom2 (ref. 31) and 2.4 kb odr-2(18) (ref. 32) by PCR from C. elegans genomic 
DNA. All of the promoters were then fused with the desired light-gated ion 
channels by PCR fusion”. 
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Restoration of auditory evoked responses by human 
ES-cell-derived otic progenitors 
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Deafness is a condition with a high prevalence worldwide, produced 
primarily by the loss of the sensory hair cells and their associated 
spiral ganglion neurons (SGNs). Of all the forms of deafness, 
auditory neuropathy is of particular concern. This condition, 
defined primarily by damage to the SGNs with relative preservation 
of the hair cells', is responsible for a substantial proportion of 
patients with hearing impairment’. Although the loss of hair cells 
can be circumvented partially by a cochlear implant, no routine 
treatment is available for sensory neuron loss, as poor innervation 
limits the prospective performance of an implant’. Using stem cells 
to recover the damaged sensory circuitry is a potential therapeutic 
strategy. Here we present a protocol to induce differentiation from 
human embryonic stem cells (hESCs) using signals involved in the 
initial specification of the otic placode. We obtained two types of 
otic progenitors able to differentiate in vitro into hair-cell-like cells 
and auditory neurons that display expected electrophysiological 
properties. Moreover, when transplanted into an auditory neuro- 
pathy model, otic neuroprogenitors engraft, differentiate and sig- 
nificantly improve auditory-evoked response thresholds. These 
results should stimulate further research into the development of 
a cell-based therapy for deafness. 

Hair-cell-like phenotypes and sensory neurons, with different 
degrees of functional maturation, have been obtained from mouse 
stem populations*'®. After transplantation, some cell types have 
shown engraftment but none have shown evidence of functional 
recovery’? '°. Although useful for research purposes, these products 
are unsuitable for a therapeutic application and appropriate cell types 
of human origin have remained elusive so far. Neuroprogenitors iso- 
lated from mature human cochleae display limited proliferative and 
differentiating potential’®, hESC-derived neural crest cells may differ- 
entiate into sensory neurons by exposure to bone morphogenetic 
protein (BMP) but lack true otic characteristics'”’*. Recently, we iso- 
lated a population of bipotent stem cells from the human fetal cochlea 
(human fetal auditory stem cells, hFASCs), with the ability to produce 
hair-cell-like cells and neurons’’. However, although hFASCs can be 
expanded in vitro for approximately 25 population doublings, they 
eventually undergo replicative senescence. Hence, there is a need for 
a reliable, renewable source of human otic progenitors, with the ability 
to produce both cell types for sensory replacement. 

Fibroblast growth factor (FGF) signalling is necessary and sufficient 
for the induction in vivo of the otic placode, the primordium of the 
hearing organ*®". As the ligands involved in placode signalling in the 
mouse have been identified as FGF3 and FGF10 (refs 22, 23), we 
proposed that exposure to these factors would trigger otic differenti- 
ation of hESCs. Initial experiments with embryoid bodies confirmed 
FGF3 and FGF10 induction of otic features (Supplementary Fig. 1a). 
We therefore focused on developing a method devoid of this initial cell- 
aggregation step, which is prone to high variability. Undifferentiated 
colonies of hESCs were dissociated for plating as a monolayer on 


laminin-coated flasks (see Supplementary Methods). Under these con- 
ditions, treatment with FGF3 and FGF10 induced the placodal markers 
PAX8 and PAXz2, either in the presence of knockout serum replace- 
ment or under defined conditions using Dulbecco’s Modified Eagle’s 
Medium (DMEM) with Ham’s F12 and N2/B27 (DFNB) medium 
(Supplementary Methods and Supplementary Figs 1b and 2). Global 
analyses of gene expression was performed using Affymetrix GeneChip 
arrays and, after normalization (see Supplementary Methods), samples 
were mined in two different ways. We first used the Gene Set 
Enrichment Analysis (GSEA) tool” to look for genes that were enriched 
in the entire list of probe sets, without establishing a priori cut off of 
differential expression (Supplementary Tables 1 and 2). This analysis 
showed that a set of otic markers was significantly enriched in the FGF- 
treated samples when compared with the undifferentiated hESCs 
(normalized enriched score (NES), 0.568; family-wise error rate 
(FWER) P= 0.046) or cells grown in DFNB (NES, 0.707; FWER 
P= 0.019) (Supplementary Table 1). A second type of analysis assessed 
genes differentially expressed using predefined criteria for fold-change 
cut offand statistical significance (see Supplementary Methods). A total 
of 1,424 genes (represented by 2,124 probe sets) was differentially upre- 
gulated in the FGF samples when compared to undifferentiated hESCs, 
whereas 423 genes (505 probe sets) were unregulated in the FGF-treated 
versus the DFNB controls (Supplementary Tables 3 and 4). Conversely, 
2,368 genes (3,231 probe sets) were downregulated in the FGF samples 
versus hESCs, and 482 genes (607 probe sets) were downregulated 
versus DFNB (Supplementary Tables 5 and 6). In a gene ontology 
analysis, the gene ontology terms ‘sensory organ development’ 
(Expression Analysis Systematic Explorer (EASE) P value score in 
FGE versus hESC, P = 3.92 X 10 '°; FGF versus DENB, P = 0.022); 
“ear development’ (FGF versus hESC, P= 4.47 X 10 °; FGE versus 
DENB, P=0.014) and ‘ear morphogenesis’ (FGF versus hESC, 
P= 3.08 x10 °; FGF versus DFNB, P = 0.0497) were highly enriched 
in the FGF-treated cells in both comparisons, and ‘mechanoreceptor 
differentiation’ and ‘auditory receptor differentiation’ were highly 
enriched in FGF versus hESC (see Supplementary Tables 7-10). Both 
bioinformatics analyses therefore suggested that the FGF treatment was 
generating a global change of transcription compatible with the induc- 
tion of otic progenitors. 

We also used immunostaining to examine the co-expression of 
PAX8 and SOX2, to define the otic progenitors at a cellular level. 
Otic progenitors grew as colonies after the inductive phase. Initial 
immunolabelling showed a relatively large proportion of double- 
positive cells in the FGF-treated condition (~78%), in contrast to 
the relatively moderate upregulation of otic transcripts detected with 
the arrays. However, a subset of cells expressed very high levels of 
PAX8 and SOX2, and these were assessed with an automated micro- 
scopy platform (InCell Analyzer 1000) that enabled quantification of 
the number of positive cells and their relative intensity (Fig. 1 and 
Supplementary Fig. 3). When a stringent threshold was selected 
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Figure 1 | FGF3 and FGF10 generate otic progenitors. Bar charts showing 
the percentage of PAX8™SOX2"!-expressing (top panel) and PAX8"FOXG1"- 
expressing (bottom panel) cells at the FGF 75th percentile threshold, obtained 
from the hESC lines H14, H7 and Shef3 (n = 3). Error bars, mean + s.e.m. 
***ED < 0.0001. 


(75th intensity percentile per cell line and antibody; see Supplementary 
Methods) 18.3% + 0.8 (+ s.e.m.) of the cells expressed high levels of 
PAX8 and SOX2 (PAX8'SOX2") after FGF treatment (against 
0% obtained without the growth factors, P <0.001), whereas 
18% + 2 cells were PAX8™FOXG1™ (compared to 4% + 4 cells for 
the control, P< 0.001). PAX8"'SOX2"*FOXGI1" cells also expressed 
the otic markers PAX2, nestin, SIX] and GATA3 (Fig. 2h and 
Supplementary Figs 4 and 5a). It is likely that this subset of 
PAX8™SOX2"FOXG1"-expressing cells represents the otic progenitors. 
The reproducibility of the protocol was tested across the hESC lines 
H7, H14 and Shef3, which all gave comparable results (see Fig. 1 and 
Supplementary Fig. 3). FGF3 and FGF10 induced two morphologically 
distinct types of otic colonies (Fig. 2a—h). One cell population showed a 
flat phenotype, with large cytoplasm and formed epithelioid islands 
(Fig. 2a-d), whereas the second was small, with denser chromatin, and 
presented cytoplasmic projections (Fig. 2e, f). Given their morpho- 
logical appearance, we have operationally named them otic epithelial 
progenitors (OEPs) and otic neural progenitors (ONPs), respectively. 
The relative proportion of these progenitors was dependent on the cell 
line, plating density and the degree of cell separation (single cells versus 
cell clusters) (Supplementary Figs 5 and 6 and Supplementary 
Methods). Progenitor colonies were purified using sequential dissoci- 
ation (see Supplementary Methods), yielding moderately homogenous 
cultures of the desired cell colony type, and were expanded in OSCFM 
(otic stem cell full media; see Supplementary Methods). 

The differentiation potential of OEPs and ONPs was tested in 
‘neuralizing’ and ‘hair-cell’ culture conditions developed previously 
using hFASCs”’ (see Supplementary Methods). OEPs produced hair- 
cell-like cells as defined by the simultaneous expression of ATOH1 and 
BRN3C, or BRN3C and MYO7A (~45%) (Supplementary Fig. 7). A 
small subset differentiated a rudimentary apical bundle, expressing 
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Figure 2 | Otic epithelial progenitors and otic neuro progenitors. 

a, b, Morphology of an OEP colony. Scale bars, 100 tum. c, d, The partial lifting 
of OEPs when treated with a short, mild trypsin incubation. Scale bars, 50 ym 
e, f, Typical morphology of ONPs, showing cytoplasmic projections. Scale bars, 
50 um. g, Side-by-side ONP and OEP colonies, double-labelled for PAX8 and 
SOX2. Scale bars, 50 jum. h, ONP colony labelled for PAX8 and nestin. Scale 
bars, 50 Lm. 


espin (Supplementary Fig 8). These hair-cell-like cells also expressed 
an outward K™ current, the inward rectifier K* current Ix, and an 
inward Ca*~ current (Ica) (Supplementary Fig. 9). Under neuralizing 
conditions, they produced a small proportion (~9%) of sensory 
neurons (Supplementary Fig. 7). Conversely, ONPs were committed 
to produce neurons. Under neuralizing conditions, almost all cells 
developed a bipolar morphology and were positive for BRN3A and 
B-tubulin III, as well as for B-tubulin III and NF200. They also 
expressed NEURODI1, ISL1I and NTRK2, a delayed-rectifier K* current 
(Ix), an Na* current (Iya), and elicited single action potentials 
(Supplementary Fig. 9). No hair-cell differentiation was obtained from 
ONPs under neuralizing or hair-cell culture conditions. Detailed 
results are given in Supplementary Information. 

The properties of ONPs in vivo were studied by transplanting them 
into ouabain-treated gerbils, a model of neuropathic deafness”. 
Application of ouabain directly to the round window selectively 
damages the type I SGNs, preserving the hair cells and the organ of 
Corti*® (Supplementary Fig. 10). After ouabain application, only a 
small number of SGNs survived (6.4%; see Supplementary Table 11). 
Most of the surviving cells (~87%) were peripherin-positive type II 
neurons, therefore less than 1% of the original population of type I 
neurons remained (Supplementary Table 11 and Supplementary Fig. 13). 
Staining for myosin VIIa and the presence of distortion product 
otoacoustic emissions (DPOAEs) confirmed that the organ of Corti 
had not been damaged (Supplementary Figs 10 and 11). DPOAEs are 
sounds produced as a consequence of electromechanical feedback 
from the outer hair cells and can be used to check their physiological 
integrity. 

ONPs derived from Shefl hESCs constitutively expressing either 
enhanced green fluorescent protein (eGFP) or tomato fluorescent 
protein were expanded in OSCFM, dissociated with trypsin and 
delivered directly into the modiolus, approaching the cochlea through 
the round window. One set of animals was transplanted 3 to 5 days 
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after ouabain application (n = 13), and another was transplanted 
2 weeks after the ototoxic drug (n = 5). As no functional or histological 
differences were encountered between the two groups (P> 0.05; 
Supplementary Fig. 12), they were analysed together. Two to three 
weeks after transplantation, five out of six animals had surviving, 
transplanted cells grafted in the modiolus, forming an ectopic spiral 
ganglion (Fig. 3a, b). Cells in the marginal sides of the ectopic ganglion 
had undergone differentiation as judged by -tubulin III staining 
(Fig. 3b) and displayed neural projections, targeting the organ of 
Corti (Fig. 3c, d). Animals were then monitored for 10 weeks post 
tranplantation. Histological analysis after 10 weeks post transplanta- 
tion showed that the ectopic ganglion was still present and that cells 
had also migrated into the Rosenthal’s canal (Fig. 3e). Transplanted 
cells expressed the 3A10 neurofilament-associated antigen and 
NKAa3 (ATP1A3), a marker of type I neurons and afferent fibres in 
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the inner ear’’ (Supplementary Fig. 14). Notably, projections from the 
transplanted cells that reached the organ of Corti were targeting the 
hair cells, and fibres positive for NKAo3 and GluA2 were found next to 
the basal pole of the inner hair cells, suggesting the presence of synaptic 
connections (Fig. 3g). Moreover, fibres form the transplanted cells 
were visualized leaving the modiolus towards the brainstem (Fig. 3f). 
In the cochlear nucleus of three gerbils we found red fluorescent 
protein (RFP)-positive fibres also stained for synaptophysin, suggest- 
ing synaptic connections with the central auditory path (Fig. 3h, i). 
Transplanted ONPs contributed significantly to restore neuronal 
density (Fig. 3j; P<0.01). Although 112.5 + 11.9 TUJ1* cells per mm* 
(+ s.e.m.) were present in the ouabain-treated, untransplanted ears, 
546.4 + 30.6 TUJ1~ cells per mm™” were found after transplantation. 
From these, 94.9+0.3% were also GFP- (or tomato)-positive, 
confirming their exogenous nature (Supplementary Table 12). The 
number of projections detected in the brainstem was considerably 
lower than the number of transplanted cell bodies identified in the 
ganglion. Although this could be explained by the limited sorting of 
fluorescent protein into the long afferent fibres, the pathfinding of 
the central innervations requires further future exploration. No 
tumours were detected in any of the transplanted animals at any stage 
throughout the experiment. 

Functional performance was determined by measuring auditory- 
evoked response (ABR) thresholds. These were established based on 
the wave ii-wave iii (P2-N3 waveform) amplitude’’. These waves are 
generated by the cochlear nucleus and the superior olivary complex 
cells, and reflect neural connections with the central auditory pathway”. 
After ouabain application, auditory function was severely impaired, 
with thresholds rising from 20 dB sound pressure level (SPL) to almost 
80 dB SPL, the maximum intensity tested. Frequency discrimination 
was also abolished. The amplitudes of wave ii-wave iii complexes were 
almost negligible at any of the frequencies explored at the maximum 
intensity of 80 dB SPL (Fig. 4d). ABRs were recorded at 1- to 2-week 
intervals. Control animals (n = 8) showed no sign of functional recovery 
throughout the experiment, with a mean auditory threshold after 
10 weeks of 75.14+2.3dB (+s.e.m.); similar to the 76.37 + 1.8dB 
obtained after ouabain treatment. However, in the transplanted animals 
(n = 18), there was a detectable improvement in the ABR thresholds 
(Fig. 4a, b) starting approximately 4 weeks post transplantation, with the 


Figure 3 | Transplantation of otic progenitors restores a population of 
spiral ganglion neurons. a, Mid-modiolar section of a transplanted cochlea 
showing the location of the newly formed, ectopic ganglion. b, Detail of the 
ganglion showing neuronal differentiation by TUJ1 staining (left panel), with 
eGFP (middle panel) and an overlaid image (right panel). c, d, Neural fibres 
project from the ganglion towards the organ of Corti (arrows), passing through 
the Rosenthal’s canal (asterisk). e, New neuronal bodies (arrows) are also found 
in the Rosenthal’s canal (asterisk). f, Ectopic ganglion at the base of the 
modiolus, projecting TUJ1” fibres centrally towards the internal auditory 
meatus. Inset, schematic of the cochlea showing the location of the ectopic 
ganglion. g, RFP” fibres (left panel; arrowheads) approaching the inner hair 
cells and expressing GluA2 (middle-left panel), primarily concentrated in 
postsynaptic densities (PSDs) around the basal pole of inner hair cells (IHCs) 
(middle-right panel; arrow). Dotted lines show the positions of the IHCs. Fibres 
(including PSDs) were also positive for NKA«3 (also known as ATP1A3; right 
panel), a marker of afferent terminals. Nine out of ten animals analysed had 
fibres contacting the IHC, whereas the three animals labelled for GluA2 were 
positive for this marker. h, i, RFP* fibres in the cochlear nucleus, expressing 
synaptophysin (arrows). The fibre branches and surrounds the cell (h), with 
morphology highly reminiscent of the maturing endbulb of Held. Left, RFP; 
middle, synapthophysin; right, overlaid image. j, SGN density 10 weeks after 
transplantation. Conditions compared are cochleae treated with ouabain and 
sham-operated (control left ear (sham)) versus those treated with ouabain and 
transplanted with ONPs (transplanted left ear). Density was significantly 
increased (P < 0.01) from 112.5 + 11.9 (control left ear (sham); n = 3; 

mean + s.e.m.) to 546.4 + 30.6 (transplanted left ear; n = 8). Asa reference, the 
density of control, right ear untreated cochleae, was 1,743 + 71.5 TUJ1* cells 
per mm”, **P <0.01. Scale bars for a—f, 100 jum; for g-i, 50 tum. 
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Figure 4 | Transplanted cells provide a recovery of ABR thresholds. 

a, Evolution of the mean ABR thresholds (click) obtained in the transplanted 
animals (n = 18; mean + s.e.m.) compared to the controls (m = 8). b, Trace 
ABR showing the abolition of waves after ouabain (AO) treatment and the 
restoration of the complexes 10 weeks post transplanation (PT). c, Graph 
showing the mean auditory-threshold shift reduction obtained by the 
transplantation (transplanted 28.6 + 3.6 dB; n = 18 versus 53 + 1.7 dB; n =8 
in the control, P = 0.0002; mean + s.e.m). d, Comparison of the wave ii-wave 
iii amplitudes obtained by tone ABRs. A general trend of enhanced amplitudes 
was obtained across all frequencies tested, being significantly (*P < 0.05) 


mean auditory threshold lowered (improved) to 50.4+4.5dB by 
10 weeks post transplantation. Furthermore, the mean auditory 
threshold shift, calculated as the difference between the threshold at 
10 weeks post transplantation versus the one before ouabain treatment, 
was of 53 + 1.7 dB in the control animals, compared to 28.6 + 3.6 dB in 
the transplanted cohort (P = 0.0002; Fig. 4c). This represents an overall 
functional improvement of approximately 46%. The range of recovery 
went from modest to almost complete (see Supplementary Fig. 15), 
which is remarkable considering the technical challenges involved in 
the procedure. Tonotopical processing was also partially restored 
(Fig. 4d). A trend in the increment of wave ii-wave iii amplitudes was 
detected at each frequency explored, with amplitudes being significantly 
different at 22, 26 and 30 kHz, when compared to the untransplanted 
animals (P<0.05). When compared to the amplitudes before the 
ouabain application, the improvement was approximately 43%. 
Latencies were mostly similar to the ones before ouabain (Fig. 4e). 


different from the untransplanted controls at 22, 26 and 30 kHz. Amplitudes 
before ouabain were equivalent between the transplanted (n = 6) and 
untransplanted animals (n = 5; mean + s.e.m.). e, Latencies of wave ii—wave iii 
complexes were, in general, comparable before ouabain and after 
transplantation. Only at 30 kHz was a significant delay was observed (BO: 
4.58 + 0.2 ms, n = 6; PT: 5.9 + 0.4ms, n = 5; P< 0.05; mean + s.e.m.). f, A 
significant correlation was observed between the mean density of TUJ1-GFP- 
positive cells and the ABR thresholds (n = 8; P< 0.05). 

*P < 0.05; ***P < 0.001. 


The only significant difference was detected at 30 kHz (before ouabain, 
4.58+0.2ms, n=6; post transplantation, 59+04ms, n=5; 
P<0.05), suggesting that some maturation was still taking place at 
this stage. Finally, there was a significant correlation between the 
increment of neural density by transplanted cells and the lowering of 
the ABR threshold (R* = 0.3867, P < 0.05; Fig. 4f). 

Our developmentally informed protocol produced hESC-derived 
auditory hair cells and neurons that closely resembled phenotypes 
obtained from hFASCs, providing validation of their cochlear 
characteristics. This was supported further by the restoration of ABR 
thresholds on transplantation of otic progenitors into a deaf adult 
mammal. The ability to reinstate auditory neuron functionality paves 
the way for a future cell-based treatment for auditory neuropathies. It 
may also, in combination with a cochlear implant, offer a therapeutic 
solution to a wider range of patients that currently remain without viable 
treatment. 
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METHODS SUMMARY 


hESCs lines used (H7, H14, Shef1, Shef3, Shefl—eGFP and Shefl-tomato), with a 
normal karyotype, were maintained on mouse embryonic fibroblast feeders 
(MEFs) under standard conditions. Although experiments with embryoid bodies 
and initial monolayer experiments were carried out in the presence of KOSR, we 
later adopted a chemically defined medium. This serum-free, chemically defined 
basal culture media included a 1:1 mixture of Dulbeco’s Modified Eagle’s Medium 
(DMEM):Ham’s F12 and N2/B27 supplements. In most experiments, FGF3 and 
FGF10 were used at 50 ng ml’. Laminin (R&D Systems) was used at 5 pgcm”. 
Antibodies, polymerase chain reaction (PCR) primers and microarray analysis are 
detailed in the Supplementary Methods. To induce hair-cell differentiation, 
progenitors were transferred to gelatin-coated dishes and cultured with DFNB 
supplemented with all-trans retinoic acid (10° M; Sigma) and epidermal growth 
factor (EGF) (20 ng ml’) for 2 to 4 weeks. To induce neuronal differentiation, cells 
dissociated with trypsin were plated on gelatin-coated dishes and incubated in 
DENB with basic FGF (20 ng ml’) and Sonic hedgehog (Shh-C24II, 500 ng mI’; 
R&D Systems). On the third day, culture was supplemented with neurotrophin 3 
(NTF3, 10 ng ml}; Petropech) and brain-derived neurotrophic factor (BDNF, 
10ng ml}; Petropech). Shh-C24II was removed on the fourth or fifth day, whereas 
the neurotrophins remained for the length of the incubation, normally between 7 
and 14 days. Conditions for electrophysiological recordings are detailed in the 
Supplementary Methods. The auditory neuropathy model was generated by 
applying 1mM ouabain directly into the round-window niche of adult gerbils. 
Either 3 days or 2 weeks later, hONPs expressing eGFP or tomato fluorescent 
protein, were injected into the modiolus. Functional recovery was monitored 
weekly by measuring ABRs and DPOAEs, for up to 10 weeks. Cochleae were taken, 
fixed and processed for analysis. Details of the hearing test and histological pre- 
paration are provided in the Supplementary Methods. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Human ES-cell culture. hESC lines H7, H14, Shef1 (including the derivatives 
Shef1-GFP and Shefl-tomato) and Shef3 with a normal karyotype were 
maintained on inactivated mouse embryonic fibroblast (MEF) feeder cells in 
knockout Dulbecco’s modified Eagle’s medium (Invitrogen) supplemented with 
20% knockout serum replacement (KOSR), 1% nonessential amino acids, 2 mM 
L-glutamine (all from Invitrogen), 0.1 mM B-mercaptoethanol (Sigma) and 4 ng 
ml ! of basic fibroblast growth factor (bFGF; R&D systems). 

Differentiation through formation of embryoid bodies. To induce the forma- 
tion of embryoid bodies, undifferentiated hESCs were dissociated into small 
clumps with collagenase IV (Invitrogen) and transferred into non-adherent 
bacterial petri dishes containing hESC culture media (minus bFGF) supplemented 
with either FGF3 (50 ng ml — ') and FGF10 (250 ngml- ') or EGE (20 ng ml and 
IGF1 at (50ngml ‘). This resulted in the formation of free-floating embryoid 
bodies. Cultures were maintained in a humidified chamber in a 5% CO, and air 
mixture at 37 °C. The embryoid bodies were cultured for 6 days and then allowed 
to attach onto tissue culture dishes coated with 0.1% gelatin. After 10 days, the 
cultures with EGF and IGF1 were supplemented with 10ngml~' bFGF for a 
further 6 to 8 days. FGF3 and FGF10 cells remained exposed to the growth factors 
throughout the experiment. All growth factors and supplements were obtained 
from R & D Systems or Invitrogen. 

Induction of otic progenitors directly as monolayers. Undifferentiated hESCs 
were dissociated with 0.025% Trypsin-EDTA (Sigma) and the cell suspension 
passed through either a 70- or 100-m cell strainer (BD Labware). The 70-um 
strainer gave primarily a single-cell suspension, whereas the 100-11m strainer 
retained a few 2-to-3-cell clumps. Cells were plated at different densities onto 
laminin-coated plastic (5 p1g cm 7; R&D systems,). Gelatin-coated dishes were 
also used, but the adhesion of cells proved to be very poor. Cells were incubated 
with chemically defined DFNB medium (DMEM high glucose: F12 mixed 1:1, 
with N2 and B27). This basal medium was supplemented, from the moment of 
plating, with either FGF3 (50 ng ml ') and FGF10 (50 ng ml ') and allowed to 
differentiate for 10 to 12 days or with EGF and IGF1 plus bFGF as described above 
for embryoid bodies formation. The medium was replaced completely every 
2 days. During the first few days post plating, a high level of cell death is customary. 
Colony enrichment and induction of differentiation into hair-cell-like cells 
and sensory neurons. To enrich for OEPs, cells surrounding the epithelial 
colonies were lifted with a quick incubation in 0.025% Trypsin-EDTA, helped 
by mechanical scrapping under a microscope with a pipette tip. Once colonies 
edges started to curl (see Fig. 2c, d), cells were rinsed away. A second, prolonged 
trypsin step allowed the collection of epithelial colonies that remained attached. 

To induce differentiation into hair-cell-like cells, we followed the protocol 
developed with hFASCs"’. Dissociated cells were transferred to gelatin-coated 
dishes and cultured with DFNB supplemented with all-trans retinoic acid 
(10 °M; Sigma) and EGF (20 ng ml ') for 2 to 4 weeks. 

To enrich for ONPs, surrounding cells were scraped off under the microscope 
using a 10-l pipette tip bent at the end. Small colonies were then allowed to grow 
further for another 2 to 3 days, before being dissociated by trypsin. The use of these 
methods gave a fairly homogenous culture for the desired cell colony type. 

To induce differentiation into auditory sensory neurons, dissociated cells were 
plated on gelatin-coated dishes and incubated in DFNB with bFGF (20ng ml’) 
and Sonic hedgehog (Shh-C24II, 500 ng ml '; R&D Systems). On the third day, 
culture was supplemented with neurotrophin 3 (NTF3, 10ng ml” '; Petropech) 
and brain-derived neurotrophic factor (BDNF, 10 ng ml ~ I. Petropech). Shh-C24II 
was removed on the fourth or fifth day, whereas the neurotrophins remained for 
the length of the incubation, normally between 7 and 14 days. 

For proliferative expansion, progenitors were cultured in otic stem cell full 

media (OSCEM; DENB plus 20ng ml bFGF, 50ng ml‘ IGF1 and 20ng ml * 
EGF). 
RNA isolation and gene expression analysis. Total RNA was isolated using 
Trizol (Invitrogen) and was reverse transcribed into cDNA using Superscript III 
(Invitrogen). PCR was performed using standard protocols with Hotstar Taq 
polymerase (Qiagen). 

Forward and reverse primer sequences, from the 5’ to 3’ direction, were as 
follows: PAX2, GAGCGAGTTCTCCGGCAAC and GTCAGACGGGGACGAT 
GTG; PAX8, ACCCCCAAGGTGGTGGAGAAGA and CTCGAGGTGGTGCT 
GGCTGAAG; GAPDH, GTCCACTGGCGTCTTCACCA and GTGGCAGTGA 
TGGCATGGAC; SOX2, ATGCACCGCTACGACGTGA and CTTTTGCACC 
CCTCCCATTT; FGF3, TTGGAGATAACGGCAGTGGA and CTCCAGGTTAT 
CCGGGCTCT; PDS, AGCAGAGACAGGTCATGGCA and ATCCGACAGGA 
ACTGCAGCT; harmonin, AGCTGGTCATCAATGAACC and AGATGGAA 
TATCCATTGATCCG; POU4F3 (also known as BRN3C), TGCAAGAACC 
CAAATTCTCC and GAGCTCTGGCTTGCTGTTCT; GATA3, GTACAGC 
TCCGGACTCTTCCC and CTGCTCTCCTGGCTGCAGACA; MYO7A, 
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CACATCTTTGCCATTGCTGAC and AGAAGAGAACCTCACAGGCAT; 
NEUROD1, GCCCCAGGGTTATGAGACTATCACT and CCGACAGAGC 
CCAGATGTAGTTCTT; ATOH1, CTCAGCCCCAGCTTCTGC and AAAC 
AACGACCATCGCAGAG; HPRT, AATTATGGACAGGACTGAACGTC and 
CGTGGGGTCCTTTTCACCAGCAAG; POU4FI (also known as BRN3A), 
GGCCCACCTCAAGATCCCGG and AGTTTCTCGGCGATGGCGGC; NTRK2 
(also known as TRKB), GAGCATCATGTACAGGAAAT and CTTGATGTT 
CTTCCTCATGT; ISLET1, CAACAAACAAAACGCAAAAC and AAGTCAA 
ACACAATCCCGA. 

Relative quantification of expression was performed using SYBR Green and the 
following primers: RPLPO, GAAGGCTGTGGTGCTGATGG and CCGGAT 
ATGAGGCAGCAGTT; PAX8, CTTGGCAGGTACTACGAGAC and GCAAAC 
ATGGTAGGGTTCTG; PAX2, CTTTAAGAGATGTGTCTGAGGG and CCT 
GTTCTGATTTGATGTGCT. ACts were calculated against the ribosomal protein 
RPLPO and AACts values compared against the levels expressed by undifferentiated 
hESCs. Undirected differentiation of embryoid bodies was induced by allowing the 
cells to aggregate in the presence of hES media (minus bFGF) supplemented with 5% 
FCS. Reactions were carried out in triplicate and values represent the mean from 2 to 
4 independent experiments. Three different hESC lines were tested. 

Affymetrix microarrays. Gene-expression profiles were obtained from hESC 
lines H14, Shefl and Shef3 by hybridizing samples from undifferentiated, 
FGF3-and-FGF10-treated and DFNB-treated cells to Gene Chip HG-U133 Plus 
2.0 arrays. Cells were cultured under differentiating conditions for 14 days before 
RNA was isolated. Normalization and initial analysis were done using puma 
(http://www. bioconductor.org). The chosen method is particularly robust and 
accurate when working with small sample sizes and the potentially high variability 
commonly found in human samples. Merging of the expression values from each 
experimental condition independently obtained from the three different cell lines 
was carried out using a hierarchical mixture of Gaussian distributions. The com- 
bined value for each transcript is the most probable value that represents the 
mixture of the two classes based on the uncertainty associated to each transcript 
at probe level. Analysis for the specific enrichment of otic markers was done using 
the Gene Set Enrichment Analysis (GSEA) tool**. This is a powerful algorithm that 
determines whether a set of genes is randomly distributed in a ranked list or 
primarily found at the top or bottom by calculating an enrichment score. This 
approach is based on the principle that a particular ‘signature’ of genes expressed 
together is highly informative, even if some individual genes change only by a 
small percentage (for example, 20%). This is because it identifies a structure of 
correlation within the genes rather than isolated outliers. A set of known plur- 
ipotency markers was run in parallel as a referential control. For the second 
analysis, probe sets were counted as differentially expressed if their expression 
changed by =1.5-fold (log, 0.5849) and their probability of positive log ratio 
(PPLR statistic) was >0.8 for upregulated or <0.2 for downregulated probe sets. 
Pathway and gene ontology (GO) enrichment analyses were carried out on these 
differentially expressed gene lists using the Database for Annotation, 
Visualization, and Integrated Discovery (DAVID; http://david.abcc.ncifcrf.gov/ 
home.jsp). Functional annotation was used to reveal biological processes highly 
represented in a probe set list and the significance or likelihood of their enrichment 
expressed using the EASE score threshold (with P < 0.05). 

Immunolabelling. Cells and sections were fixed in cold 4% paraformaldehyde in 
PBS for 10 minutes, permeabilized and blocked in 0.1% Triton X-100, 5% normal 
donkey serum in PBS for 20 minutes at room temperature (20-25 °C) and then 
incubated with the primary antibody in the same buffer. Antibodies used in this 
study have been widely used and well characterized. These antibodies were against 
SOX2 (1:100, rabbit polyclonal, Millipore); nestin (1:100, mouse monoclonal, 
Abcam); PAX8 (1:100, goat polyclonal, Abcam); PAX2 (1:100, rabbit polyclonal, 
Abcam), GATA3 (1:50, mouse monoclonal, Santa Cruz), FOXG1 (1:50, rabbit 
polyclonal, Abcam), SIX1 (1:100, mouse monoclonal clone 3C7, Sigma), 
ATOHI (1:100, rabbit polyclonal, Abcam); BRN3c (1:100, mouse polyclonal, 
Abnova), espin (1:100, rabbit polyclonal, Sigma); B-tubulin III (TUJ1) (1:100, 
mouse monoclonal, Covance); NF200 (1:100, rabbit polyclonal, Sigma), 
Synaptophysin (1:150, mouse monoclonal, Millipore), 3A10 (1:50, mouse mono- 
clonal clone, DSHB, Iowa), BRN3a (1:100, rabbit polyclonal, Chemicon), NKAo3 
(1:75, goat polyclonal, Santa Cruz) and GluR2 (1:100, mouse monoclonal clone 
121/32, Millipore). As GFP and tomato fluorescent protein can become down- 
regulated after transplantation, their signal was amplified using either anti-GFP 
(1:100, rabbit polyclonal, Torrey Pines Biolabs) or anti-RFP (1:100, rabbit poly- 
clonal, Abcam) antibodies. The myosin VIIA antibody was a gift from C. Petit. 
Specific labelling was visualized with secondary donkey anti-mouse, anti-goat or 
anti-rabbit antibodies conjugated to either Alexa Fluor 488 or Alexa Fluor 568. 
Controls were carried out by replacing the primary antibody with unspecific 
mouse or rabbit immunoglobulin-G (IgG). Nuclei were counterstained with 
DAPI (4',6-diamidino-2-phenylindole; Sigma). 
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Images were acquired using a Zeiss Axiophot microscope using Axio Vision 

software. For conventional quantification, several hundred cells were scored from 
random fields from two to five independent experiments. Statistical comparisons 
of means were made using analysis of variance (ANOVA; two-way ANOVA 
followed by the Bonferroni post test). For all statistical tests P< 0.05 was used 
as the criterion for statistical significance. 
Quantification using the InCell Analyzer. The hESC lines H7, H14 and Shef3 
were seeded at 1,500 cells per well of a 96-well clear flat-bottom plate (655090; 
Greiner Bio-one) in either FGF or DFNB media. Images of several thousand 
stained cells from three independent wells were acquired using an automated 
microscopy platform (InCell Analyzer 1000, GE Healthcare). Forty random fields 
were acquired in each well using a X20 objective. Images were analysed using 
Developer Toolbox 1.7 software (GE Healthcare). 

Nuclei were counted as positive at two different thresholds of fluorescent 

intensity; these were set independently for each cell line and plate. The first 
threshold was set according to the fluorescent intensity distribution histogram 
of the cells in the control (no primary antibody) wells: cells were counted as 
positive, green or red, if the nuclei intensity was greater than the 99th intensity 
percentile point in channel 2 (green) or 3 (red), respectively. Similarly, a second 
threshold was set to count very highly positive cells. This was determined from the 
fluorescent intensity distribution of the cells grown in FGF; cells were counted as 
highly positive if the nuclei intensity was greater than the 75th intensity percentile 
point in this condition. Significance was determined using Chi-square with Yates’ 
correction. 
Electrophysiology. Membrane currents from undifferentiated cells, and differenti- 
ating hair-cell-like cells and sensory neurons were measured using the whole-cell 
patch-clamp technique with an Optopatch (Cairn Research) or axopatch 200B 
(Molecular Devices) amplifiers. Recordings were carried out at room temperature 
(20-25 °C) from cells cultured between 9 and 19 days under either ‘hair cell’ or 
‘neuralizing’ conditions, or soon after completing the induction with FGF3 and 
FGF10 (undifferentiated). Current clamp recordings of neuron-like cell-voltage 
responses were performed at body temperature (35-37 °C). The extracellular solu- 
tion contained (in mM): 135 NaCl, 5.8 KCI, 1.3 CaCl, 0.9 MgCl, 0.7 NaH,PO., 
5.6 D-glucose, 10 HEPES-NaOH and 2 sodium pyruvate. Amino acids and vitamins 
for Eagle’s minimal essential medium (MEM) were added from concentrates 
(Fisher). The pH was adjusted to 7.5. Cells were viewed using an upright microscope 
(Leica) and continuously superfused with the above extracellular solution. Patch 
pipettes were made from soda glass capillaries (3-4 MQ) and coated with surf wax. 
The pipette filling solution contained (in mM): 131 KCl, 3 MgCl, 1 EGTA-KOH, 
5 Na,ATP, 5 Hepes-KOH and 10 sodium phosphocreatine (pH 7.3). 

Data were acquired using pClamp software and a Digidata analogue-to-digital 
converter (Molecular Devices). Data were filtered at 2.5 or 5 kHz, sampled at 5 or 
50 kHz and stored on computer for off-line analysis using Clampfit and Origin 
(OriginLab) software. Membrane capacitance (C,,) was 30 + 4 pF (n = 43, range 
from 9 to >100 pF) and residual series resistance was 7.8 + 0.7 MQ, resulting in 
voltage-clamp time constants of 244 + 40 1s. Membrane currents were elicited by 
applying voltage steps in 10-mV nominal increments or decrements from the 
holding potential of -84 mV or -104 mV (for Ix, Ica and INa) or -64 mV (for Ixy). 

Recordings and reported currents were corrected off-line for linear leakage, 
typically calculated between -84mV and -74mV (undifferentiated cells: 
0.4+0.1ns, 1=6; hair-cell-like cells: 2.3+0.8ns, n= 14, neuron-like cells: 
1.7 + 0.6ns, n = 15). Membrane potentials were corrected for the voltage drop 
across the residual series resistance (R,) and for a liquid junction potential of 
-4mV. Statistical comparisons of means were made using ANOVA (one-way 
ANOVA followed by the Tukey post-test). For all statistical tests P< 0.05 was 
used as the criterion for statistical significance. Mean values are quoted + s.e.m. in 
text and figures. 

Transplantation and cochleae processing. Animal experiments were approved 
by the Sheffield University Ethical Review Committee and carried out under a 
Home Office Project License conforming to UK legislation. Anaesthesia in young, 
adult gerbils (3-8 months old) was induced with ketamine and xylazine, and 
maintained with isoflurane during the surgical intervention. Before the deafening 
procedure, auditory function was measured as described below. Under sterile 
conditions and using a retro-auricular approach, the bulla from the left ear was 
exposed and a small hole opened on its surface. Twenty microlitres of a 1-mM 
ouabain solution (Sigma) were applied to the round-window niche, incubated for 
30 min and then absorbed with a small cotton wick. The right cochlea was left 
undisturbed and used as a control in each animal. Opening in the bulla was sealed 


with a bit of fascia and Vetbond (3M), the surgical wound closed with sutures and 
the animal allowed to recover for 3 days. 

Before cell transplantation, auditory function was measured to ascertain the 
efficacy of the deafening protocol. Bulla was exposed as before and the round- 
window niche re-opened. A small hole was drilled into the modiolus by going 
through the round-window membrane, using a 30G needle or a KFlex dental file 
(no. 25). Human ONPs, expanded in OSCEM, were dissociated and collected to a 
density of 1.5 X 10*cellsper ul in DMEM. About 3 (4-5 X 10‘ cells) were 
injected into the central modiolus using a stainless steel NanoFil syringe (WPI) 
with a 33G tungsten bevelled needle. Control animals underwent an identical 
intervention, but were transplanted with DMEM only. To prevent rejection, 
gerbils were given daily injections of cyclosporine A (15 mgkg ' day’; Sandoz). 

At selected time points, animals were killed and fixed by transcardiac perfusion 
with 4% paraformaldehide in PBS. Cochleae were removed, post-fixed overnight 
and decalcified by immersion in 0.125 M EDTA for approximately 2 weeks. Tissue 
was embedded in Cryo-M-Bed (Bright) and sectioned in a cryostat. 

Spiral ganglion cell densities. Cells present after ouabain treatment and after 
transplantation, and expressing the appropriated markers were counted in the 
apical, mid and basal turns in 7 to 27 randomly selected mid-modiolus sections 
from each cochlea. The area was measured using ImageJ and densities calculated as 
cells per mm’. Statistical comparisons of means were made using ANOVA. For all 
statistical tests P< 0.05 was used as the criterion for statistical significance. Mean 
values are quoted + s.e.m. in text and figures. 

ABRs and DPOAEs. ABR testing was conducted in an isolated laboratory room. 
Prior to testing, the gerbils were sedated with ketamine and xylazine and placed on 
a heating pad. The ABRs were recorded using System 3 digital signal processing 
hardware and software (Tucker Davis Technologies (TDT)). ABR stimuli were 
presented using two enhanced real-time processors (RP2.1), a programmable 
attenuator (PA-5) to control the stimulus levels and closed-field magnetic speakers 
(CF1). The CF1 speaker presented the ABR stimuli through a 10-cm tube that 
allowed its delivery directly into the ear canal. The ABR responses were recorded 
using 27GA subdermal needle electrodes (Rochester Electro-Medical) connected 
to a low impedance headstage (RA4LI) and medusa preamplifier (RA4PA) before 
sending them to a medusa base station signal processor (RA16). Click stimuli were 
presented at a rate of 20s ', ranging from 80dBSPL to 20dBSPL in 10-dB 
decrements. For tone ABRs, 5-ms pure-tone pips (ranging from 6kHz to 
38 kHz in 4-kHZ increments) were presented at an intensity of 80 dB SPL. The 
ABR waveforms were produced through differential voltage recordings from elec- 
trodes placed at the vertex (recording electrode) and ipsilateral mastoid (reference 
electrode), with the ground electrode placed on the lower back. Each ABR wave- 
form represented the average response to 500 stimulus presentations. The ampli- 
tude between the wave ii positive peak and wave iii negative peak were measured 
for each dB SPL, and the ABR thresholds were determined as the stimulus level 
that evoked a voltage that was 2 s.d. above the mean background noise level. 

DPOAE testing was conducted in the same room, and gerbils were prepared for 
the procedure as they were for ABRs testing. DPOAEs were recorded using the 
same TDT workstation under different settings. The instrumentation used to 
present the DPOAE stimuli consisted ofa single ear probe unit to direct the stimuli 
into the ear canal by the Etymotic ER-10B” low noise microphone system that 
provided 20 dB of gain, and a microphone amplifier (MA-3) that provided an 
additional 20 dB of gain to the DPOAE responses prior to digital conversion. The 
sampling rates to generate stimuli and digitize the responses were 100 kHz. The 
DPOAESs were generated by simultaneously presenting two sinusoids differing in 
frequency into the ear canal of the gerbil (the lower frequency was labelled f1 and 
the higher frequency was labelled f2). The amplitude of the distortion product at 
the frequency defined by 2f1 — f2 was then measured by recording the pressure in 
the ear canal. The stimuli were selected for DPOAE testing from 6 kHz to 42 kHz 
with 4-kHz increments. The sound levels for the fl and f2 primaries were cali- 
brated to 65 dB SPL and 60 dB SPL, respectively, using an ACOustical interface 
system (ACO Pacific) with a 2-cc calibration syringe. 

All ABR and DPOAE stimuli were created using TDT SigGen and recordings 
conducted using TDT BioSig software. Statistical comparisons of means were 
made using either the unpaired Student’s two-tailed t-test for two data sets, or 
for comparisons of multiple data sets, using ANOVA (two-way ANOVA followed 
by the Bonferroni post test). For all statistical tests P< 0.05 was used as the 
criterion for statistical significance. Mean values are quoted + s.e.m. in text and 
figures. 
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After host entry through mucosal surfaces, human immuno- 
deficiency virus-1 (HIV-1) disseminates to lymphoid tissues to 
establish a generalized infection of the immune system. The 
mechanisms by which this virus spreads among permissive target 
cells locally during the early stages of transmission and systemically 
during subsequent dissemination are not known’. In vitro studies 
suggest that the formation of virological synapses during stable 
contacts between infected and uninfected T cells greatly increases 
the efficiency of viral transfer’. It is unclear, however, whether T-cell 
contacts are sufficiently stable in vivo to allow for functional 
synapse formation under the conditions of perpetual cell motility 
in epithelial’ and lymphoid tissues*. Here, using multiphoton 
intravital microscopy, we examine the dynamic behaviour of 
HIV-infected T cells in the lymph nodes of humanized mice. We 
find that most productively infected T cells migrate robustly, result- 
ing in their even distribution throughout the lymph node cortex. A 
subset of infected cells formed multinucleated syncytia through 
HIV envelope-dependent cell fusion. Both uncoordinated motility 
of syncytia and adhesion to CD4* lymph node cells led to the 
formation of long membrane tethers, increasing cell lengths to up 
to ten times that of migrating uninfected T cells. Blocking the egress 
of migratory T cells from the lymph nodes into efferent lymph 


vessels, and thus interrupting T-cell recirculation, limited HIV 
dissemination and strongly reduced plasma viraemia. Thus, we 
have found that HIV-infected T cells are motile, form syncytia 
and establish tethering interactions that may facilitate cell-to-cell 
transmission through virological synapses. Migration of T cells in 
lymph nodes therefore spreads infection locally, whereas their 
recirculation through tissues is important for efficient systemic 
viral spread, suggesting new molecular targets to antagonize HIV 
infection. 

In HIV infection, lymph nodes are important sites of viral replica- 
tion, in which the high local density of CD4* T cells and other target 
cells may favour cell contact-mediated viral spread. Because infection 
of BLT (bone marrow/liver/thymus) humanized mice’ replicates many 
hallmarks of infection in humans°*, we used this small animal model 
(Supplementary Fig. 1a) and intravital microscopy to investigate the 
dynamic behaviour of HIV-infected T cells in the stromal environ- 
ment of lymph nodes. 

We first sought to validate this mouse model for studying the effects 
of HIV infection on T-cell migration. The presence of both naive and 
memory human CD4* and CD8* T cells in secondary lymphoid 
organs of BLT mice demonstrates their general capacity to home to 
these sites (Fig. la). To determine the efficiency of this trafficking 
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Figure 1 | Human T-cell migration in lymph nodes of BLT mice. a, Human 
CD4* T cells with naive (CD45RA* CCR7°), central memory (CD45RA , 
CCR7*) and effector memory (CD45RA_, CCR7_) phenotypes are 
represented in lymphoid organs and peripheral blood of BLT mice. Relative 
frequencies in blood are similar to humans. Plots are gated on human cpD4"*, 
CD3* cells. PBL, peripheral blood lymphocytes. b, Homing of human and 
mouse CD4* T cells into lymphoid tissues. Data pooled from three 
independent experiments. Values are mean and s.e.m. MLN, mesenteric lymph 
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nodes; pLN, peripheral lymph nodes. c, Multiphoton intravital micrograph of a 
BLT popliteal lymph node 24 h after adoptive transfer of human (green) and 
mouse (red) T cells. Graph on the right shows migratory tracks of each 
population during a 30-min recording. d, e, Mean 2D-track velocities (d) and 
arrest coefficients (e) of mouse and human T cells in BLT lymph nodes. Lines 
and numbers indicate medians. Data pooled from three recordings, two 
independent experiments. n.s., not significant. 
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process we infused BLT mice with mixtures of isogenic CD4* human 
T cells and mouse CD4* T cells from OT-II T-cell receptor (TCR) 
transgenic mice. Over the course of 18h, human T cells were half as 
efficient as mouse cells at entering peripheral lymph nodes, whereas 
their accumulation in mesenteric lymph nodes was equivalent 
(Fig. 1b). Thus, the interspecies molecular interactions between 
selectins, chemokine receptors and integrins on human T cells with 
their ligands expressed on mouse lymph node high endothelial 
venules’ are collectively functional, albeit at a slightly reduced level. 

After entering lymph nodes, T-cell migration is guided by their 
sensing of chemokines'® and interactions with the paracortical 
fibroblastic reticulum", which is well preserved in BLT mice (Sup- 
plementary Fig. 1b). We compared the migration of human and mouse 
T cells in BLT lymph nodes by multiphoton intravital microscopy 
(MP-IVM). Intravenously infused human and mouse cells distributed 
similarly in the lymph node cortex, were indistinguishable in various 
aspects of motility (Fig. 1c—e, Supplementary Fig. 1c—e and Supplemen- 
tary Video 1), and in this regard each resembled mouse T cells in non- 
humanized mouse lymph nodes*. We concluded that human T-cell 
homing to lymph nodes was efficient, and cell migration within the 
lymph nodes of BLT mice was, using mouse T cells as a reference, 
normal, allowing us to study the effects of HIV infection on this basic 
process. 

To seed lymph nodes with HIV, we injected the footpad of BLT mice 
with green fluorescent protein (GFP)-expressing, CCR5-using HIV 
(hereafter referred to as HIV-GFP), which we derived by replacing 
the V3 loop region of env in the originally CXCR4-tropic NL4-3- 
IRES-GFP” with that of HIV BaL (Supplementary Fig. 2a, b). 
Subcutaneous infection reliably produced high levels of plasma 
viraemia (Fig. 2a) and systemic infection (Supplementary Fig. 2c). 
However, between two and six days after footpad infection, we 
detected GFP* cells in the ipsilateral popliteal lymph node only, and 
not in remote lymph nodes, indicating that the infection was initially 
contained in the primary draining lymph nodes (Fig. 2b and 
Supplementary Fig. 2e). The vast majority of these infected cells were 
resting (sscl’), antigen-experienced (CD45RO*) T cells with vari- 
able expression of CCR7. By day two, some loss of CD4 cell-surface 
expression was apparent’’, but this was much more pronounced at 
day 6, when cells had also downregulated major histocompatibility 
complex (MHC)-I (Supplementary Fig. 2e). 

To investigate the localization of HIV-infected T cells in lymph 
nodes early after footpad infection, we analysed draining lymph nodes 
by MP-IVM two days after virus inoculation. At this time, GFP* cells 
were evenly distributed up to several hundred micrometres away from 
the subcapsular sinus (SCS), indicating that lymph-borne HIV 
arriving in the lymph node SCS has efficient means of infiltrating 
the lymph node cortex (Fig. 2c). In time-lapse recordings it became 
apparent that GFP-expressing lymph node cells migrated at average 
two-dimensional (2D) velocities of ~7 Um min’ and were thus 
robustly motile, suggesting that their motility facilitated the local 
dissemination of HIV infection in lymph nodes (Fig. 2d-f and 
Supplementary Video 2). Because most infected T cells in early simian 
immunodeficiency virus (SIV)-infection of macaques are resting 
memory T cells’*, and the majority of infected lymph node cells in 
our BLT mouse model resembled antigen-experienced T cells (Fig. 2b), 
we compared their motility to that of in vitro-generated, GFP- 
expressing, central memory-like CD4* human T cells (hereafter 
termed Toys) that express CD62L and CCR7 and migrate to lymph 
nodes after adoptive transfer (Supplementary Fig. 3a—c). Compared 
with this reference population, the motility of HIV-infected lymph 
node cells was reduced (Fig. 2e, f, Supplementary Fig. 3d, e and 
Supplementary Videos 2 and 3), which may reflect the ability of HIV 
to interfere with cytoskeletal processes involved in cell migration’*’’. 

Although most infected lymph node cells resembled antigen- 
experienced T cells in terms of motility, size and shape, a subpopula- 
tion (10-20%) of cells stood out by their unusually elongated, thin and 
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Figure 2 | In vivo dynamics and phenotype of HIV-infected lymph node 
cells. a, Footpad injection of BLT mice with HIV-GFP produces robust and 
sustained viraemia. “HIV” is identical to HIV-GFP but lacks an IRES-GFP 
cassette. Similar results as shown here for five mice were obtained by other 
routes of infection (Supplementary Fig. 2d). Dashed line and grey-shaded area 
indicate mean and 95% confidence interval, respectively, of background 
signals obtained from plasma of uninfected mice. s.c., subcutaneous. 

b, Draining and non-draining lymph node (LN) cells two days after footpad 
infection with HIV-GFP. Grey dot plots and histograms show GFP’ SSC°” 
lymph node cells. ¢, An intravital micrograph recorded from a popliteal lymph 
node two days after footpad infection with HIV-GEP. d, Migratory tracks of 
GEP* lymph node cells during a 30-min recording. e, f, Mean 2D-track 
velocities (e) and arrest coefficients (f) of HIV* lymph node cells compared 
with uninfected, GFP-expressing Tcys, recorded in lymph nodes of 
uninfected BLT mice. Lines and numbers indicate medians. Data on HIV- 
infected lymph node cells and Tes are representative of four and two 
independent experiments, respectively. g, MP-IVM time-lapse recordings of 
an HIV-infected lymph node cell (top) and an uninfected Te, (bottom) in 
BLT lymph nodes. Arrows indicate leading and trailing edge of the infected 
cell. Elapsed time in minutes:seconds. h, Instantaneous cell skeletal length of 
HIV-infected lymph node cells and Tes from recordings as shown in g. Lines 
indicate medians. Percentages indicate events >30 tm, highlighted by dashed 
blue box. i, Representative traces of infected lymph node cells and Tous 
showing instantaneous cell skeletal length (colour-coded) and instantaneous 
migratory velocity over time. Traces selected from 142 recorded in four 
movies, three independent experiments. 


sometimes branched trailing edges. These cells reached average 
skeletal lengths of at times more than 100 um, whereas Toys rarely 
exceeded 20 jim (Fig. 2g, h and Supplementary Videos 2-4). When we 
tracked skeletal lengths in individual cells along with their motility, we 
noted that some of the elongated cells continued to be motile (Fig. 2i). 
Similar results were obtained with SF162R3 (ref. 17), a GFP-expressing 
strain of HIV derived from the primary R5-tropic clone SF162 (Sup- 
plementary Fig. 4 and Supplementary Video 5). 
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Because it was possible that our flow cytometry analysis failed to 
detect these elongated cells (Fig. 2b), and that these were in fact not T 
cells, we wanted to determine unambiguously whether HIV-infected T 
cells could acquire such unusual morphological characteristics in vivo. 
Therefore, we adopted a co-culture method" to efficiently infect Toms 
with HIV-GFP in vitro (Fig. 3a), and transferred these cells into the 
footpads of BLT mice, from where they migrated to draining popliteal 
lymph nodes. Recipient BLT mice were pre-treated with a cocktail of 
antiviral drugs® to prevent infection of any host cells. Under these 
conditions, in which the T-cell identity of GFP* cells in popliteal 
lymph nodes was known, we observed the same overall reduction 
in motility as well as a sub-population of abnormally elongated 
cells at a similar frequency to after in situ infection (Fig. 3b-d and 
Supplementary Video 6). Co-injection of uninfected Toys also con- 
firmed that these and HIV-infected T cells localized to the same 
regions of the lymph node and that decreased motility of HIV-infected 
cells did not result from their sequestration into a particular micro- 
environment (Supplementary Fig. 5 and Supplementary Video 6). 
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Figure 3 | HIV induces an elongated phenotype in infected T cells. 

a, Analysis of in vitro HIV-infected Toms in BLT lymph nodes. FTC, 
emtricitabine; TDF, tenofovir disoproxil fumarate. b, MP-IVM time-lapse 
series of uninfected (red) and HIV’ (green) Tcms in a BLT lymph node. 
Elapsed time in minutes:seconds. ¢, Mean 2D-track velocities. Red lines and 
numbers indicate medians. d, Correlation of mean cell skeletal length and arrest 
coefficient of individual cells. Dashed lines indicate threshold values based on 
measurements of uninfected Tcys. Data pooled from two independent 
experiments. e, Production of an R5-tropic, GFP-expressing lentiviral vector. 
egfp, enhanced GFP; int, integrase; LTR, long terminal repeat; RT, reverse 
transcriptase. f, Forty-eight hours after footpad injection of 1.4 X 10° infectious 
units, GEP* cells are found in draining lymph nodes. g, h, These cells do not 
elongate (g) and do not show the reduction of cell motility of HIV-infected 
lymph node cells (h). Percentages in g refer to cells >30 tum, highlighted by 
dashed blue box. The red lines and numbers in h indicate medians. Data are 
representative of two independent experiments per three mice. 
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The elongated morphology could result from the infection of a 
T-cell population that has an intrinsic propensity to adopt such shapes. 
Alternatively, HIV proteins might actively induce this phenotype. To 
distinguish between these two possibilities, we generated a CCR5- 
using, HIV-derived lentiviral vector that expresses GFP but no HIV 
proteins in infected cells (Fig. 3e). Two days after footpad injection, a 
similar number of GFP cells was observed in draining lymph nodes as 
after infection with HIV-GFP. However, these cells did not exhibit 
elongated shapes. They also migrated faster than HIV-GFP-infected 
lymph node cells and at similar speeds to Toys (Fig. 3f-h and 
Supplementary Video 7), further supporting the idea that HIV induces 
migratory deceleration of infected cells. Thus, one or several HIV 
proteins induces the abnormal shape and decreased motility of HIV- 
infected T cells. 

During HIV assembly, which in migrating, polarized T cells takes 
place at the uropod”’, the HIV Env glycoprotein is exposed on the cell 
surface and can trigger the formation of intercellular adhesive contacts 
with CD4-expressing cells in vitro*’’. In the lymph node environ- 
ment, in which T cells are highly motile, such adhesive interactions 
could lead to abnormal uropod elongation through tethering of 
infected T cells to uninfected CD4* lymph node cells. Supporting this 
model, we consistently observed stationary behaviour of the trailing 
edges of migrating T cells, often followed by their rapid retraction, 
perhaps reflecting sudden release of tethered uropods (Supplementary 
Video 8). Alternatively, cellular elongations could also result from the 
formation of multinucleated syncytia, which also depends on the func- 
tion of Env. To test whether either uropod tethering or syncytia 
formation are involved in the elongation of HIV-infected T cells, we 
generated HIV-GFP(AEnv), in which large parts of Env, including the 
CD4-binding region, are not expressed (Supplementary Fig. 6a). 
When trans-complemented with an exogenous R5 HIV envelope, 
the virus was as infectious as HIV-GFP, but no secondary infections 
occurred, as expected, because infected cells expressed no functional 
Env protein (Supplementary Fig. 6b-d). Notably, the elongated 
phenotype of infected lymph node cells was completely lost in the 
absence of functional Env (Fig. 4a—c and Supplementary Video 9). 
However, cells infected with HIV-GFP(AEnv) still migrated at reduced 
speeds (Fig. 4b, c), suggesting that HIV factors other than Env were 
largely responsible for the decreased T-cell motility. Cell elongation 
did not result from artificial interactions of Env with mouse adhesion 
or extracellular matrix proteins, as it was not observed in HIV-infected 
Tcms migrating in the lymph nodes of non-humanized mice (Fig. 4d 
and Supplementary Fig. 7). To test whether Env interactions with CD4 
specifically, or with other human ligands such as the o4f,7-integrin”, 
are involved in tethering, we mutated HIV-GFP to exchange a single 
crucial amino acid in Env (D368R in the HBX2 consensus sequence) to 
abrogate CD4-binding”* (Supplementary Fig. 8). Cells infected with 
HIV-GFP(D368R) showed the same loss of cell elongation as observed 
after Env deletion (Fig. 4d). 

To examine whether some of the highly elongated T cells are part of 
multinucleated syncytia, we analysed infected BLT lymph nodes by 
histology. However, although we did find GFP-expressing cells with 
several nuclei (not shown), we were unable to visualize the elaborate 
tethers observed by MP-IVM in thick tissue sections, which may 
explain in part why these presumably highly labile structures have 
not been observed in human histological tissue samples. Therefore, 
we generated HIV-nGFP, in which GFP is targeted to cell nuclei 
through the addition of a nuclear localization signal (Supplementary 
Fig. 9). When we infected BLT mice with this strain, we observed by 
MP-IVM that the vast majority of elongated lymph node cells were 
multinucleated syncytia that, given the involvement of Env, probably 
developed through cell fusion (Fig. 4e, f). These syncytia frequently 
migrated with their tightly clustered nuclei moving in a coordinated 
fashion, but intermittently dispersed so that individual nuclei moved in 
different directions, yet remained connected by long membrane tethers 
(Supplementary Video 10). However, both fused and unfused cells also 
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Figure 4 | HIV-infected T cells tether to other lymph node cells and form 
syncytia through Env, and migrate to distant tissues to disseminate 
infection. a, Intravital micrographs from lymph nodes of BLT mice injected 
48 h earlier with HIV-GFP (left image) in one footpad, and with HIV- 
GFP(AEnv) (right) in the other footpad. b, Representative traces of infected T 
cells depicting instantaneous cell skeletal length (colour-coded) and migratory 
velocity over time. Traces selected from 281 recorded in 14 movies, 3 
independent experiments. c, Mean cell skeletal length and arrest coefficient of 
individual cells infected with HIV-GFP or HIV-GFP(AEnv). Data are pooled 
from six or eight recordings, respectively, from three independent experiments. 
d, Cell lengths of HIV-GFP-infected Tes injected into natural killer cell- 
depleted, antibody-deficient Dy,LMP2A mice, and of BLT lymph node cells 
infected in situ with HIV-GFP(D368R) or HIV-GFP. e, Recording of lymph 
node cells infected with HIV-nGFP. Bottom panels indicate border between 
cytoplasm and nuclei, based on an 80% maximum fluorescence intensity (FI) 
threshold. f, Cell lengths of mono- and multinucleated, HIV-infected cells. 

g, h, Viraemia in NSG BLT mice treated with FTY720 or vehicle starting at the 
day of s.c. HIV infection via the cheek skin. g, Infection with HIV-GFP. 

h, Infection with clone SF162R3. FTY720 treatment was ended at day 56. 

i, Plasma viraemia under FTY720 treatment started four weeks after 
intraperitoneal (i.p.) infection, when viraemia had stabilized. One 
representative experiment of two is shown. 


formed free-ending tethers, indicating that those were caused by 
adhesive interactions with non-visualized CD4* cells (Supplemen- 
tary Video 11). Although similar membrane tethers, such as 
nanotubes, can also form between T cells in vitro in the absence of 
viral infection’, Env proteins of retroviruses such as HIV have been 
shown to facilitate their formation’. Adhesive tethers may promote 
cell-to-cell viral transfer through virological synapses, or may reflect 
the initiation of cell fusion, or both. 

Movement of free HIV virions within and between tissues is 
probably restricted by the limits of diffusion, by anatomical barriers, 
and by soluble immune factors, such as complement and natural 
antibody’. Allowing infected T cells to retain robust motility and to 
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serve as migratory vehicles might therefore be a viral strategy for 
efficient local and systemic dissemination. If long-range transport of 
HIV to distant tissues does rely on the trafficking of infected cells 
during physiological T-cell recirculation, then interfering with 
lymphocyte trafficking should limit systemic viral dissemination and 
reduce plasma viraemia. 

To block T-cell egress from lymph nodes into efferent lymph 
vessels, we treated BLT mice with the functional sphingosine 
1-phosphate receptor antagonist FTY720 (fingolimod), which resulted 
in profound lymphopenia (Supplementary Fig. 10a). Notably, in mice 
that received FTY720 at the time of infection with HIV-GFP, viral 
RNA in peripheral blood remained within or only slightly above the 
background range (Fig. 4g), whereas the drug did not show antiviral 
activity in cell culture (Supplementary Fig. 10b). This result was con- 
firmed with another R5-tropic strain, SF162R3 (ref. 17; Fig. 4h). Two 
months after infection, viral RNA was strongly reduced, yet not absent 
from remote lymphoid tissues of treated animals (Supplementary 
Fig. 10c), suggesting that FTY720 in the longer term did not fully 
prevent, but only limited dissemination within the lymphoid system. 
Accordingly, the development of viraemia after FTY720 withdrawal 
following 8 weeks of treatment indicated that the release of T cells from 
lymphoid tissues rapidly initiated efficient systemic viral replication 
(Fig. 4h). Interestingly, we also observed a reduction in viral load in the 
draining lymph node in long-term-treated animals (Supplementary 
Fig. 10c). This may reflect local depletion of HIV T-cell targets, but 
could also indicate that FTY720 treatment reduces viral replication 
through a general perturbation of T-cell homeostasis or through 
delayed, direct antiviral activity, independently of its effect on the 
trafficking of HIV-infected T cells. However, FTY720 did not affect 
viraemia in mice with pre-established infection (Fig. 4i), similar to 
what was observed in simian-human immunodeficiency virus 
(SHIV) infection of macaques”. Although we cannot rule out other 
direct or indirect effects of FTY720 on viral replication, the observation 
that HIV infection is limited and high-level viraemia prevented by 
interfering with T-cell recirculation before, but not after, systemic viral 
dissemination, suggests an important role for the trafficking of 
infected, migratory T cells in the efficient transport of HIV from 
draining to remote lymphoid tissues. 

In summary, we find that HIV induces a partial reduction in the 
motility of infected T cells, as well as their tethering to and fusion with 
CD4* immune cells in lymph nodes. This may reflect the need for HIV 
to tune T-cell migratory and interactive behaviour to meet two oppos- 
ing requirements: the use of motile cells as vehicles to disseminate 
within and between tissues, which inherently involves short-lived cell 
contacts; and the formation of virological synapses and, possibly, 
syncytia, for cell-to-cell spread, which probably requires longer-lasting 
cell contacts. Reduced motility of infected T cells might favour the 
formation and maintenance of tethering contacts with potential HIV 
target cells, while still allowing for efficient trafficking to distant tissues. 


METHODS SUMMARY 

BLT humanized mice. Sublethally irradiated, female NOD/SCID mice were 
transplanted under the kidney capsule with 1 mm? fragments of human fetal liver 
and thymus, and injected intravenously with 1 X 10°-5 X 10° purified CD34* 
human fetal liver cells. After 18-20 weeks, healthy mice with >50% human 
lymphoid reconstitution in peripheral blood were used for experiments within 
6 weeks. In experiments that did not require popliteal lymph nodes, NOD/SCID X 
gc ' (NSG; gc is also known as Il2rg) mice were used as graft recipients. 

HIV reporter strains. The CCR5-using, GFP-expressing virus NL4-3 IRES-GFP 
R5 (HIV-GFP) was constructed from pBR-NL4-3-IRES-GFP-nef* (ref. 12) by 
replacing the V3 loop region of env with the corresponding sequence from the R5 
clone BaL. HIV-GFP(AEnv) and HIV-GFP(D368R) were generated by site-directed 
mutagenesis. To produce infectious HIV-GFP(AEnv) and HIV-GFP(D368R), the 
BaL Env was trans-complemented using a helper plasmid during virus production. 
HIV-nGFP was generated by inserting the $V40 nuclear localization sequence 
amino-terminal of GFP in HIV-GFP. The lentiviral vector pHAGE-CMV-GFP 
was packaged with the R5 Env and standard helper plasmids to produce the 
CCR5-using ‘R5 LV-GEP’, which expresses no HIV proteins in infected cells. 
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MP-IVM of HIV-infected cells in lymph nodes. For in situ-infection of lymph 
node cells, BLT mice were injected into the footpad with 0.5 X 10°-1.5 X 10° 
infectious units of recombinant HIV or R5 LV-GFP. Forty-eight hours later, the 
draining popliteal lymph nodes were imaged by MP-IVM, as described’. 
Alternatively, HIV-GFP-infected Tous were injected into the footpads of BLT 
mice pretreated with antiretroviral drugs, and imaged by MP-IVM 12h later. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

BLT humanized mice. Female NOD/SCID (NS) and NOD/SCID X ge! ~ (NSG; 
gc is also known as I/2rg) mice were purchased from Jackson Laboratories and 
reconstituted at age 6-8 weeks with human immune systems as previously 
described*’. In brief, mice were conditioned with sublethal (2 Gy) whole-body 
irradiation and transplanted with 1 mm? fragments of human fetal thymus and 
liver (17-19 weeks of gestational age, from Advanced Bioscience Resources) under 
the kidney capsule. CD34* cells were isolated from human fetal liver using anti- 
CD34 microbeads (Miltenyi Biotec) and injected intravenously (1 10°-5 X 10° 
cells per mouse) within 6 h of surgery to create BLT mice. Mice were monitored for 
human haematopoietic reconstitution at 14 and 18 weeks after transplantation. 
High lymphoid reconstitution in peripheral blood (>25% lymphocytes in PBL, 
>50% human lymphocytes, >40% T cells among lymphocytes) correlated 
with optimal lymph node reconstitution suitable for intravital imaging (typically 
at 18-20 weeks). 

NS mice were used for all experiments involving footpad injection of HIV or 
cells and analysis (for example, by MP-IVM) of draining popliteal lymph nodes. 
NSG mice, which lack popliteal but possess cervical lymph nodes, were used for 
long-term experiments involving FTY720 treatment. OT-II TCR transgenic mice 
were obtained from Jackson Laboratories and bred in house. 

All experiments were performed in accordance with the rules and regulations of 
the Center of Comparative Medicine and approved by the MGH Institutional 
Animal Care and Use Committee and the Harvard Medical Area Standing 
Committee on Animals. 

Plasmids and virus production. The proviral plasmid pBR-NL43-IRES-EGFP- 
nef’ (pleG-nef")*! was obtained from the NIH AIDS Research & Reference 
Reagent Program (ARRRP). The V3 loop of env was modified to resemble that 
of HIV BaL and confer R5-tropism, as previously described”. In brief, a commercial 
kit (Stratagene) and primers 5’-CCCAACAACAATACAAGAAAAAGTATAC 
ATATAGGACCAGGCAGAGCATTATATACAACAGGAGAAATAATAGGA 
GATATAAGACAAGCACATTGTAACATTAG-3’ and 5'-CTAATGTTACAA 
TGTGCTTGTCTTATATCTCCTATTATTTCTCCTGTTGTATATAATGCTC 
TGCCTGGTCCTATATGTATACTTTTTCTTGTATTGTTGTTGGG-3’ were 
used according to the manufacturer's protocol to mutagenize pleG-nef" and 
obtain pleG-nef* RS. The non-GFP-expressing parental plasmid, pNL4-3, was 
similarly mutagenized to obtain pNL4-3 R5. Infectious virus generated form 
pleG-nef* R5 and pNL4-3 RS are here termed HIV-GFP and HIV, respectively. 

The pSVIIIexE7-ADA env-expressing vector (from J. Sodroski) was modified 
by restriction cloning to express env from pleG-nef* R5 using KpnI and BamHI 
restriction sites to obtain pSVIIexE7-BaL. 

To construct an env-deleted reporter construct, pleG-nef* R5 was digested with 
Psil (nucleotide 6865 and 7599) and religated. The resulting frameshift mutation 
introduced a premature stop codon while the RRE remained intact. The new 
plasmid pleG-nef* Aenv was packaged with pSVIIIexE7-BaL to obtain HIV- 
GFP(AEnv). 

To construct the CD4-binding-deficient reporter virus HIV-GFP(D368R), 
site-directed mutagenesis of pleG-nef* R5 was performed using primers 
5'-GCAATCCTCAGGAGGGCGCCCAGAAATTGTAACG-3’ and 5’-CGTTA 
CAATTTCTGGGCGCCCTCCTGAGGATTGC-3’ (Stratagene). — Successful 
mutagenesis was confirmed by DNA sequencing. HIV-GFP(D368R) was packaged 
with pSVIIexE7-BaL to obtain single-round infectious viral supernatant. 

The lentiviral vector construct pHAGE-CMV-GFP is a modification of a con- 
struct originally developed in the laboratory of R. C. Mulligan*. pHAGE-CMV- 
GFP was packaged with standard helper plasmids and pSVIIIexE7-BaL to obtain 
the non-replicating, R5-using vector R5 LV-GFP. To construct the vector 
pHAGE-NLS-Cerulean, first GFP was replaced in pHAGE-CMV-GFP with 
Cerulean using restriction enzyme sites Ncol and Xmal to obtain pHAGE- 
CMV-Cerulean. Then, the complementary primers 5’-AGCCCCATGGCT 
CCAAAAAAGAAGAGAAAGGTAGGCGCCATGGCT-3’ and 5'-AGCCATG 
GCGCCTACCTTTCTCTTCTTTTTTGGAGCCATGGGCT-3’ were annealed, 
digested with Ncol and inserted into pHAGE-CMV-Cerulean, placing the nuclear 
localization signal (NLS) peptide of SV40 immediately upstream of GFP. The same 
cloning strategy was used to construct HIV-nGFP from HIV-GFP. pSF162R3 was 
provided by A. Brown. 

All HIV and lentiviral stocks were produced by transfecting HEK293T cells 
using Lipofectamine 2000 (Invitrogen). Viral supernatants were clarified through 
a0.22 1m filter and centrifuged at 90,000¢ for 2 h using an SW28Ti rotor (Beckman 
Coulter) over a 20% sucrose cushion. Viral stocks were titrated using MAGI.CCR5 
cells and expressed as blue focus units per ml, as described previously™. 

Cells. Human and mouse CD4™ T cells were purified from uninfected BLT and 
OT-II mice, respectively, by negative immunomagnetic selection (Miltenyl 
Biotec), and labelled with 2M Celltracker green (CMFDA) or 10M 


Celltracker blue (CMAC, both from Invitrogen) before adoptive transfer into 
BLT mice (see below). 

Human CD4* central memory-like T cells (Toms) were generated as described 
previously with slight modifications®*. In brief, CD4* T cells were isolated from 
spleen and lymph nodes of BLT mice by positive immunomagnetic selection using 
CD4 microbeads (Miltenyi Biotec) to purities consistently above 95%. Cells were 
activated by adding Dynabeads coated with anti-human CD3¢/CD28 antibody 
(3:1 bead:cell ratio, Invitrogen) for 2 days in RPMI 1640 supplemented with 10% 
FCS (Atlanta Biologicals), 2 mM L-glutamine (Gibco), 1 mM sodium pyruvate and 
10mM HEPES (Mediatech). After 2 days, cells were washed and cultured for 
another 6-8days in medium containing 501Uml™’ human rIL-2 (R&D 
Systems), keeping cell density close to 5 X 10° cells ml’. Cells were used for all 
experiments between days 8 and 10. 

To infect Tews with HIV-GFP with high efficiency, we used a modified 
co-culture method’. In brief, 2 10° MAGLCCRS cells were transfected with 
10 pig of pleG-nef* R5, and 24h later T cells were added to the confluent layer of 
virus-producing cells. Two days later, T cells were collected by Ficoll gradient 
centrifugation and washed three times before injection into BLT mice. Because 
we could generate only a limiting number of HIV-infected Tcus by this approach, 
we injected the cells into footpads for initially selective and efficient delivery to 
draining lymph nodes”. After intravenous (i.v.) injection, cells distributed to all 
secondary lymphoid tissue and accumulated in only smaller numbers in the 
popliteal lymph node under study. In some studies, HIV-infected Tos were 
injected into the footpad of DyLMP2a mice** depleted of natural killer cells through 
daily i.p. injections of 50 pil anti-asialo GM1 antibody (Wako Chemicals) for 2 days. 

To generate Tcys that were not infected with HIV-GEP, but expressed GFP for 
visualization by MP-IVM and comparison with HIV-GFP-infected cells, we trans- 
duced human T cells with vesicular stomatitis virus protein G (VSV-G) pseudo- 
typed pHAGE-CMV-GFP on day2 after activation. In some experiments, 
uninfected Toys were stained for 15min at 37°C with 10M CellTracker 
Orange (CMTMR, Invitrogen) before footpad or i.v. injection into BLT mice. 

MAGI.CCR5, U87.CD4, U87.CD4.CCR5 and U87.CD4.CXCR4 cells were all 
obtained from the ARRRP. 

T-cell homing studies. Isogenic human and mouse OT-II CD4* T cells were 
purified, differentially labelled with cell-tracker dyes, and injected iv. into BLT 
mice. Eighteen hours later, flow cytometry was used to determine homing ratios 
based on the relative frequencies of human and mouse T cells injected and 
recovered in various tissues. 

HIV infection of BLT mice. For analyses of HIV infection in popliteal lymph 
nodes, BLT mice were injected into footpads with 5 X 10*-1.5 X 10° infectious 
units of various HIV strains. 

For longer-term studies on the effects of FTY720 treatment on the course of 
infection, 4 X 10* infectious units of HIV-GFP were injected into the cheek skin of 
NSG-derived BLT mice, which is drained by cervical lymph nodes. 

Plasma viraemia. Mice were bled from the superficial temporal vein at indicated 
times and sera stored at —80 °C. Viral RNA was isolated from 20-70 11 of plasma 
using the QlAamp viral RNA kit (Qiagen), reverse transcribed, and complementary 
DNA amplified by quantitative PCR using SYBR green (Qiagen) on the Lightcycler 
480-II (Roche). Plasma from uninfected BLT mice was used to determine the 
reverse transcriptase PCR (RT-PCR) background signal, which was consistently 
higher than the background signal obtained with water. 

Quantification of tissue viral load. Mice were euthanized at the specified days 
and lymph nodes and spleen were removed and stored in 200 ul of RNAlater 
solution overnight (Ambion). Single-cell suspensions from tissues were obtained 
using the gentleMACS dissociator (Miltenyi Biotec) and RNA was extracted using 
a commercial kit (Qiagen). Quantification of viral RNA was performed as 
described above. Viral loads were normalized to total tissue RNA. 
Pharmacological treatments. FTY720 (1 mgkg ' body weight) or vehicle was 
injected ip. every other day starting 1 h before HIV infection of BLT mice to block 
T cell egress from lymph nodes”. 

FTC (100 mgkg ') and TDF (150 mgkg ') were ip. injected daily starting 
2 days before footpad injection of HIV-infected Tcys to prevent infection of 
recipient BLT mice. 

Intravital multiphoton microscopy and image analysis. BLT mice were 
anaesthetized and the popliteal lymph nodes microsurgically exposed for MP- 
IVM as previously described’’. Imaging depth was typically 80-200 um below 
the lymph node capsule. For multiphoton excitation and second harmonic 
generation, a MaiTai Ti:sapphire laser (Newport/Spectra-Physics) was tuned to 
between 830 and 920 nm for optimized excitation of the fluorescent probes used. 
For 4D recordings of cell migration, stacks of 11 optical sections (512 512 pixels) 
with 4-l1m z-spacing were acquired on an Ultima IV multiphoton microscope 
(Prairie Technologies) every 15 or 24s to provide imaging volumes of 40 1m in 
depth. Emitted light and second harmonic signals were detected through 455/ 


©2012 Macmillan Publishers Limited. All rights reserved 


50. nm, 525/50 nm, 590/50 nm and 665/65 band-pass filters with non-descanned 
detectors. 

Data sets were transformed in Imaris 7.3.1 (Bitplane) to generate maximum 
intensity projections for export as Quicktime movies. Because automated 3D- 
tracking of cell migration through Imaris did not perform well on the complex 
shapes of elongated HIV-infected cells, we used automated or manual 2D-tracking 
of cell centroids for all motility analyses, which yielded slightly lower velocity 
measurements than 3D-tracking. 2D cell lengths were measured in Image] as 
the longest path connecting front and end of unbranched cells or as the sum of 
path lengths of all branches of individual branched cells (‘cell skeletal length’). 
Either instantaneous (frame-by-frame) cell lengths or average cell lengths for 
individual cell tracks are provided. Cell step and track parameters were further 
analysed in Matlab (Mathworks). 

All time-lapse sequences are accelerated 360X over real-time for display as 
Quicktime movies. 

Flow cytometry. Phenotypic characterization of T cells was performed on an 
LSRII (Becton Dickinson), using FlowJo software (Tree Star) for analysis. 
Spleen and lymph nodes were minced with fine forceps and passed through a 
40-tm mesh to obtain single-cell suspensions. Cells were washed, counted and 
stained with a panel of directly conjugated anti-human monoclonal antibodies: 
CD3-APC/Cy7 (HIT3a), CD4-PE/Cy7 (OKT4), CCR7-APC (TG8), CD8-PerCP 
(RPA-T8), CD45RA-PE (HI100), CD45RO-PE (UCHLI), MHC-I-PerCP 
(W6/32), CCR5-PE (HEK/1/85a) and CD62L-Alexa488 (DREG-56) (Biolegend). 

HIV-infected cells were fixed with 2% paraformaldehyde after staining before 

analysis. 
Immunohistochemistry. Frozen sections of BLT and C57BL/6 lymph nodes 
previously fixed in paraformaldehyde were processed for histological analysis 
using standard techniques. In brief, 20 um frozen sections were permeabilized 
with Triton X-100, blocked with BSA and anti-CD16/32, and stained with 
polyclonal anti-desmin rabbit IgG (ab15200; abcam) and either mouse anti- 
human CD4 mouse IgG1 (clone RPA-T4, Biolegend) or FITC-anti-mouse CD4 
rat IgG2b (clone GK1.5, Biolegend) antibodies. Alexa Fluor 555-anti-rabbit IgG 
(A31572, Invitrogen) and FITC-anti mouse IgG (ab97022, Abcam) were used as 
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secondary antibodies. Stained sections were embedded in VectaShield Hard set 
(Vector H-1400) and analysed by multiphoton microscopy. 

Statistical analysis. An unpaired Student’s t-test or Mann-Whitney U test were 
used for comparisons of data sets with normal or non-normal distribution, 
respectively, using Prism 5 (GraphPad). For the homing studies, one-sample t-test 
was used for comparison to a hypothetical value of 1, corresponding to identical 
homing efficiency. When P values were smaller than 0.05, differences were con- 
sidered as significant. 
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Caspase-11] increases susceptibility to Salmonella 
infection in the absence of caspase-1 


Petr Broz', Thomas Ruby’, Kamila Belhocine', Donna M. Bouley”, Nobuhiko Kayagaki’, Vishva M. Dixit? & Denise M. Monack' 


Inflammasomes are cytosolic multiprotein complexes assembled 
by intracellular nucleotide-binding oligomerization domain 
(NOD)-like receptors (NLRs) and they initiate innate immune 
responses to invading pathogens and danger signals by activating 
caspase-1 (ref. 1). Caspase-1 activation leads to the maturation and 
release of the pro-inflammatory cytokines interleukin (IL)-1B and 
IL-18, as well as lytic inflammatory cell death known as pyroptosis’. 
Recently, a new non-canonical inflammasome was described that 
activates caspase-11, a pro-inflammatory caspase required for 
lipopolysaccharide-induced lethality’. This study also highlighted 
that previously generated caspase-1 knockout mice lack a func- 
tional allele of Casp11 (also known as Casp4), making them 
functionally Casp1 Casp11 double knockouts**. Previous studies 
have shown that these mice are more susceptible to infections with 
microbial pathogens’, including the bacterial pathogen Salmonella 
enterica serovar Typhimurium (S. typhimurium)’*, but the indi- 
vidual contributions of caspase-1 and caspase-11 to this phenotype 
are not known. Here we show that non-canonical caspase-11 activa- 
tion contributes to macrophage death during S. typhimurium 
infection. Toll-like receptor 4 (TLR4)-dependent and TIR-domain- 
containing adaptor-inducing interferon-f (TRIF)-dependent 
interferon-B production is crucial for caspase-11 activation in 
macrophages, but is only partially required for pro-caspase-11 
expression, consistent with the existence of an interferon-inducible 
activator of caspase-11. Furthermore, Casp1~’~ mice were signifi- 
cantly more susceptible to infection with S. typhimurium than mice 
lacking both pro-inflammatory caspases (Casp1~’~ Casp11~‘~ ). 
This phenotype was accompanied by higher bacterial counts, the 
formation of extracellular bacterial microcolonies in the infected 
tissue and a defect in neutrophil-mediated clearance. These results 
indicate that caspase-11-dependent cell death is detrimental to the 
host in the absence of caspase-1-mediated innate immunity, result- 
ing in extracellular replication of a facultative intracellular bacterial 
pathogen. 

Previous studies have shown that logarithmic phase S. typhimurium 
induce a rapid NLRC4-dependent cell death in cultured macrophages 
that requires the type 3 secretion system (T3SS) encoded by Salmonella 
pathogenicity island 1 (SPI-1). We previously reported that 
Salmonella grown to stationary phase (decreased SPI-1 expression) 
do not induce rapid NLRC4 activation, but establish themselves in 
an intracellular niche’®. Intracellular Salmonella are detected by the 
inflammasome receptors NLRP3 and NLRC4, and mature IL-1 and 
IL-18 are released 12-17 h after infection (Supplementary Figs 1 and 
2a, b)'®. In addition, intracellular Salmonella induce an uncharacterized 
form of lytic cell death that is independent of the SPI-1 T3SS (ref. 11). 
To investigate host factors involved in this type of Salmonella- 
induced macrophage death, we infected macrophages deficient for 
specific inflammasome components with stationary phase wild-type 
Salmonella. Macrophage death, which required infection with live 
bacteria, did not require NLRP3 or the adaptor protein apoptosis- 
associated speck-like protein containing a CARD (ASC), but was 


partially dependent on NLRC4 (Supplementary Figs 2a and 3). 
Because macrophages from Casp1‘~ Casp11~‘~ mice did not release 
lactate dehydrogenase (LDH), we investigated whether caspase-1 and 
caspase-11 were activated and processed in response to intracellular 
Salmonella (Supplementary Fig. 2a). Processed caspase-1 p20 and 
caspase-11 p30 subunits were detected during Salmonella infection, 
indicating that caspase-11 activation correlated with cell death 
(Supplementary Fig. 2a). Consistently, Casp11~’~ macrophages were 
significantly more resistant to death than wild-type macrophages 
(Fig. 1a), demonstrating an important role for caspase-11 in cell death 
caused by intracellular Salmonella. In contrast to Legionella pneumophila 
infections’’, intracellular growth of Salmonella in wild-type and 
Casp11-’~ macrophages was not significantly different (data not 
shown). 

Macrophage death was not totally abrogated in Casp11~‘~ macro- 
phages infected with wild-type Salmonella, indicative of another cell 
death pathway (Fig. la and Supplementary Fig. 2c). Because NLRC4 
contributed to macrophage death (Fig. 1a and Supplementary Fig. 2a), 
we determined whether NLRC4 activation accounted for the remain- 
ing cell death seen in Casp11-’~ macrophages. The second T3SS 
(SPI-2), which is used by intracellular Salmonella to inject effector 
proteins into the host cell, can also inject flagellin’, a ligand for 
NLRC4 (refs 13-16). Comparable levels of cell death were observed 
in Casp1 ‘~ Casp11-/~ and Casp11~’~ macrophages infected with a 
ASPI-2 or a flagellin-deficient strain of Salmonella (Fig. 1a and 
Supplementary Fig. 2c), suggesting that wild-type Salmonella induce 
cell death through two separate pathways, one controlled by NLRC4 
and the other requiring caspase-11. 

Finally, we compared the levels of cell death in macrophages derived 
from wild-type, Casp1-‘~ Casp11-’-, Casp11-’— or Casp1‘~ 
(Casp1~’~ Casp11') mice’ infected with Salmonella. Levels of cell death 
in Casp1 ‘~ macrophages infected with an SPI-2-deficient strain were 
similar to levels seen in wild-type macrophages (Supplementary Fig. 2d), 
indicating that Salmonella strains that cannot inject flagellin exclusively 
engage caspase-11-dependent cell death. Consistently, Casp11~/~ 
macrophages infected with the ASPI-2 strain did not die (Sup- 
plementary Fig. 2d). By contrast, wild-type Salmonella induced cell 
death by canonical (NLRC4/caspase-1) and non-canonical (caspase- 
11) signalling pathways (Supplementary Figs 1 and 2d). 

NLRP3-mediated cytokine production triggered by non-canonical 
inflammasome stimuli depends on both caspase-1 and caspase-11 
(ref. 3). We therefore investigated whether the NLRP3 pathway 
induced by intracellular Salmonella required both caspases. Because 
IL-1B and IL-18 release in response to intracellular Salmonella is exclu- 
sively mediated by NLRC4 and NLRP3 (Supplementary Fig. 2a, b), we 
studied the response to SPI-2- or flagellin-deficient Salmonella’. 
Cytokine maturation in response to these strains required both 
caspase-1 and caspase-11 (Supplementary Fig. 4). By contrast, 
cytokine maturation induced by wild-type Salmonella, which activates 
both NLRP3 and NLRC4, was only partially dependent on caspase-11, 
but absolutely required caspase-1 (Supplementary Fig. 4). These 
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Figure 1 | Signalling through TLR4 and TRIF is required for activity of the 
non-canonical inflammasome pathway. a, LDH release from unprimed bone 
marrow-derived macrophages (BMDMs) infected with the indicated S. 
typhimurium strains for 17 h. b, ¢, IL-1 secretion, LDH release and 
immunoblots for processed caspase-1, caspase-11, IL-1 and IL-18 released 
from unprimed BMDMs infected with flagellin-deficient (Aflag) S. 
typhimurium for 17h. Ext, extract; SN, supernatant. d, Induction of pro- 
caspase-11 expression in unprimed BMDMs infected with flagellin-deficient S. 
typhimurium. Graphs show the mean and s.d. of quadruplicate wells and are 
representative of three (a—c) or two (d) independent experiments. 


results indicate that intracellular Salmonella activate a non-canonical 
inflammasome similar to lipopolysaccharide/cholera toxin B and the 
enteric bacteria Citrobacter rodentium, Vibrio cholerae and Escherichia 
coli’. Formation of ASC foci (specks), a measure of NURP3-ASC com- 
plex formation, required caspase-11 but not caspase-1 (Supplementary 
Fig. 5a, b), indicating that caspase-11 acts upstream of the NLRP3- 
ASC complex. 

Stimulation of resting macrophages with lipopolysaccharide or 
interferon (IFN)-y induces pro-caspase-11 expression**’’. To deter- 
mine whether Salmonella-dependent activation of the non-canonical 
inflammasome was dependent on Toll-like receptor (TLR)-mediated 
recognition, we infected Tlr4’~ macrophages with flagellin-deficient 
Salmonella (which activate the caspase-11-dependent, non-canonical 
inflammasome pathway exclusively; Supplementary Figs 2 and 4). 
Caspase-11 activation, IL-1 secretion and cell death depended on 
TLR4 (Fig. 1b). Caspase-11 processing and cell death required the 
TLR4-dependent signalling adaptor TRIF, but not the TLR4-dependent 
signalling adaptor MyD88 (Fig. 1c). By contrast, IL-1 maturation was 
reduced in both Myd88~’~ and Trif ’~ (also known as Ticam1 ‘~) 
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macrophages, suggesting that cytokine maturation requires both 
adaptors. Expression of pro-IL-18 required MyD88-signalling (Sup- 
plementary Fig. 6a, b), explaining the lack of mature IL-1 release in 
Myd88 ‘~ macrophages (Fig. 1c). Because cytokine production and 
cell death required TRIF, we measured pro-caspase-11 expression in 
Myd88 ‘~, Trif ‘~ and Myd88’~ Trif-’~ macrophages infected with 
Salmonella. Although induction of pro-caspase-11 expression was 
delayed in Myd88 ‘~ and Trif ’ macrophages, the levels of 
pro-caspase-11 protein in Myd88-‘~ Trif ‘~ macrophages were sig- 
nificant (Fig. 1d and Supplementary Fig. 6a, b). Thus, pro-caspase-11 
protein expression is partially dependent on TLR-signalling. However, 
other pathways probably contribute. Intriguingly, non-canonical 
inflammasomes were not activated in Trif ’" macrophages (Fig. 1c) 
even though significant amounts of pro-caspase-11 were present. 
These results indicate that caspase-11 activity requires a TRIF- 
dependent signal. 

The adaptor protein TRIF induces nuclear factor-«B (NF-«B) 
activation and signals through interferon regulatory factor 3 (IRF3) 
to induce the expression of type-I IFNs"*. To investigate whether type-I 
IFNs could be the TRIF-dependent signal required for caspase-11 
activation, we compared the levels of IL-1f release, cell death and 
pro-caspase-11 expression in wild-type, Caspl1~’ Casp11-’~ and 
Irf3‘~ macrophages (Fig. 2a, b and Supplementary Fig. 6c). Irf3'~ 
macrophages were significantly impaired in their ability to initiate 
caspase-11-dependent IL-1 release and cell death even though sig- 
nificant levels of pro-caspase-11 were present, albeit at slightly reduced 
levels when compared with wild-type macrophages. To confirm the 
requirement of type-I IFN signalling for caspase-11 activation, we 
measured non-canonical inflammasome activation in macrophages 
lacking components of the type-I and type-II interferon signalling 
cascade. Ifnarl ’~ or STAT-1-/~ (also known as Socs1~’~) macro- 
phages infected with Salmonella did not process caspase-11 or induce 
non-canonical cell death (Fig. 2c), and this was not due to a lack of 
pro-caspase-11 expression (Fig. 2d). Macrophages lacking Ifngr were 
indistinguishable from wild-type macrophages, thereby excluding an 
involvement of IFN-y. To confirm a dependency on type-I IFN sig- 
nalling for caspase-11 activation, macrophages infected with flagellin- 
deficient Salmonella were treated with recombinant mouse IFN-B 
(Fig. 2e). Consistent with an important role for type-I IFN in 
caspase-11 activation, exogenous IFN-f restored cell death and 
caspase-11 processing in infected Myd88‘ Trif’~ but not 
Casp1~’~ Casp11~’~ macrophages (Fig. 2e and Supplementary Fig. 7). 
Importantly, uninfected macrophages treated with IFN-B did not 
induce LDH release, confirming that IFN-B alone cannot induce 
non-canonical cell death in the absence of an infection. Our results 
reveal a previously unreported requirement for type-I IFN signalling 
in caspase-11 activation that is consistent with a model in which an 
IFN-inducible activator mediates caspase-11 activation in response to 
intracellular Salmonella (Supplementary Fig. 1). 

Finally, to extend our findings to an in vivo setting, we infected 
wild-type, Casp1”-’~ Casp11-‘~, Casp1-‘~ (Casp1-‘~ Casp11'8) and 
Casp11-’~ mice orally with wild-type Salmonella. As reported previ- 
ously, Casp1~’~ Casp11-‘~ mice had significantly higher tissue 
bacterial loads than the wild-type mice”* (Fig. 3a). Surprisingly, the 
levels of bacteria in Casp1~’~ mice were significantly higher than in 
Casp1-’~ Casp11-’~ mice (Fig. 3a). Interestingly, bacterial loads in 
Casp11~’~ mice were comparable to wild-type mice in all organs 
examined. Unexpectedly, these results indicate that activation of 
pro-inflammatory caspase-11 is detrimental to the host in the absence 
of caspase-1. Consistent with these findings, the bacterial loads in 
Nirp3 ’ ~Nirc4~’— animals, which activate caspase-11, but not 
caspase-1 in response to Salmonella (Supplementary Fig. 2), were 
significantly higher than in Casp1~’~ Casp11-‘~ mice (Supplemen- 
tary Fig. 8), essentially phenocopying Casp1‘—~ 

Although previous studies have shown that Casp1-/~ Casp11/~ 
mice are more susceptible to many pathogens, the exact mechanism 
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Figure 2 | Type-I IFN signalling is required for caspase-11 activation, but 
not for pro-caspase-11 expression. a, c, IL-1B secretion, LDH release and 
immunoblots for processed caspase-1 (CASP1), caspase-11 (CASP11) and IL- 
1f released from unprimed BMDMs infected with flagellin-deficient S. 
typhimurium for 17h. b, d, Time course measuring pro-caspase-11 and pro-IL- 
1 induction in unprimed BMDMs infected with flagellin-deficient S. 
typhimurium. e, Cell death in BMDMs treated with recombinant mouse IFN-f 
or vehicle control and infected with flagellin-deficient S. typhimurium for 17h. 
Graphs show the mean and s.d. of quadruplicate wells and are representative of 
three (a, c, e) or two (b, d) independent experiments. 


underlying these findings is not well understood"’. Host defence 
against systemic Salmonella infection requires neutrophils, because 
Salmonella replication in the liver and spleen is exacerbated in 
neutropenic mice**’. In addition, Salmonella becomes vulnerable to 
neutrophil-mediated clearance when it transits between host cells”. 
We therefore investigated whether caspase-1 or caspase-11-deficiency 
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Figure 3 | Casp1~’~ mice are more susceptible to Salmonella infection than 
Casp1~’" Casp11~’~ mice. a, Bacterial burden in mice infected orally with 
green fluorescent protein (GFP) * wild-type S. typhimurium at day 4. The 
mean (horizontal lines) and detection limit (dashed lines) are shown. b, Flow- 
cytometry analysis of neutrophil counts (Ly6G*), Salmonella-associated 
neutrophils (Salmo~), and the intracellular distribution of S. typhimurium 
(GFP*) in macrophages (F4/80) and neutrophils (Ly6G*) in the spleen. UI, 
uninfected controls. In the box plots, lines denote the median count, and the 
twenty-fifth and seventy-fifth percentiles, whiskers denote maximum and 
minimum values. In the bar chart, the mean and s.e.m. is shown. c, Livers from 
S. typhimurium-infected mice showing typhoid nodules, mats of extracellular 
bacteria (stars) and single Salmonella (arrowheads). Original magnification, 
X10 or X40 (inset). H&E, haematoxylin and eosin. Graphs depict 6-10 mice 
per genotype and are representative of two to three independent experiments. 
*P < 0.05; **P<0.01. 


resulted in reduced neutrophil influx into the spleen. Splenic 
neutrophil counts (Ly6G* cells) were reduced in mice lacking either 
caspase-1 or caspase-11 in comparison with wild-type mice, with 
Casp1-’~ Casp11 ’~ mice having the most significant reduction of 
neutrophil counts compared with wild-type animals (Fig. 3b). Because 
the levels of neutrophils in the mice deficient for caspase-1 or caspase-11 
were very similar to one other, and did not correlate with the bacterial 
loads, we concluded that differences in neutrophil levels alone could 
not account for the higher bacterial levels in Casp1~’~ animals 
(Fig. 3a). We next examined whether caspase-1 deficiency resulted in 
any functional defects in splenic neutrophils. Neutrophils have been 
previously implicated in phagocytosing and killing extracellular 
Salmonella released by pyroptotic macrophages”. Interestingly, the 
percentage of neutrophils (Ly6G*) among all Salmonella-associated 
cells (Salmo*) was significantly reduced in Casp1~’ ~ and 
Casp1~’~ Casp11-’~ mice compared with wild-type and Casp11~’~ 
mice (Fig. 3b). These data suggested that the lack of caspase-1 resulted 
in reduced bacterial uptake by neutrophils or reduced association with 
Salmonella during infection. However, because this reduction was 
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observed in Casp1~’~ and in Casp1-’ Casp11-’~ mice, it did not 
explain the significantly higher number of colony-forming units 
(c.fu.) in Casp1 ~~ mice (Fig. 3a). 

NLRC4/caspase-1 induced lysis releases intracellular Salmonella, 
thus making them accessible to neutrophil-mediated uptake and kill- 
ing”. Because both caspase-1 and caspase-11 can induce lysis of 
infected cells, we speculated that the lack of caspase-11 in 
Casp1-‘~ Casp11~’~ mice might delay the egress of Salmonella from 
infected macrophages. Consistent with this model, we found that 
Salmonella were present to a higher degree in macrophages in 
Casp1-’~ Casp11-’" mice than in wild-type, Caspl1-’ and 
Casp1~’~ mice (Fig. 3b). Gram stain and specific anti-Salmonella 
antibody staining of tissues showed that wild-type and Casp11-/~ liver 
sections contained low levels of Salmonella, consistent with the c.f.u. 
data (Supplementary Fig. 9). Casp1-/~ Casp11~’~ liver sections con- 
tained high levels of bacteria within cells in the sinusoids 
(Supplementary Fig. 9), which is consistent with our fluorescence- 
activated cell sorting (FACS) data indicating that a larger proportion 
of Salmonella are associated with macrophages in these mice (Fig. 3b). 
Casp1~‘~ liver sections contained mats of extracellular bacteria within 
typhoid nodules and expanded sinusoids (Fig. 3c and Supplementary 
Figs 9 and 10a). Furthermore, mice infected with Salmonella were 
injected with the membrane-impermeable antibiotic gentamicin to dis- 
tinguish intracellular bacteria from extracellular bacteria. Gentamicin 
treatment significantly reduced bacterial counts in Casp1~’~ mice, 
consistent with our histological finding that Salmonella is largely extra- 
cellular in these mice (Supplementary Fig. 10b). 

We conclude that caspase-1 deficiency results in reduced 
neutrophil-mediated clearance of Salmonella released from infected 
macrophages, thus supporting extracellular growth of this facultative 
intracellular pathogen. In keeping with this observation, it has 
been reported that Salmonella rapidly replicates extracellularly in 
the liver of neutropenic mice”. This phenotype is alleviated in 
Casp1-’" Casp11-’~ animals, because bacterial egress from the 
infected macrophages is delayed. Thus, caspase-11-mediated cell death 
results in detrimental effects to the host in the absence of caspase-1. 
Our results indicate that the ability of neutrophils to phagocytose 
bacteria is dependent on a caspase-1-mediated function. Because 
Casp1~’ Casp11~’-, Iirl1~'~ or I1b-’~ 1118-‘~ mice all have 
similar Salmonella levels*, the mechanism is probably not dependent 
on IL-1 and IL-18 maturation. Future studies are required to identify 
and characterize this function. Here we provide evidence that caspase- 
11-dependent cell death is exploited by Salmonella in the absence of 
caspase-1 to cause disease in the host, highlighting the need to deter- 
mine whether caspase-11 activation has similar detrimental effects for 
the host in other infectious disease models. 


METHODS SUMMARY 

Mice. Casp1/~ Casp11-/~ (caspase-1 knockout), Nirp3 ’~ Nired/~, Casp11 /~ 
and Casp1 ‘~ (Casp1 ‘~ Casp11'8) mice were backcrossed to C57BL/6 for at least 
ten generations*”"°. All mouse studies were approved by the institutional animal 
care and use committees of Genentech and Stanford University. 

Animal infections. Mice (fasted for 12h) were inoculated orogastrically with 
2.4 X 107-1 X 10° c.f.u. of wild-type or GEP* wild-type S. typhimurium SL1344. 
Tissues were collected at day 4 after infection, homogenized, and dilutions were 
plated on Luria—Bertani (LB) agar containing 100 jig ml ' streptomycin. Bacterial 
numbers are expressed as c.f.u. gram ' tissue. 

Cell culture and infections. BMDMs were differentiated as described previ- 
ously’*. S. typhimurium was grown to stationary phase overnight in LB broth at 
37 °C with aeration. Cells were infected at a multiplicity of infection (m.o.i.) of 
100:1 and centrifuged for 15 min at 500g to ensure comparable adhesion of the 
bacteria to the cells. Gentamicin (100 pg ml ') was added 60 min after infection. 
Cells were washed 120 min after infection, followed by the addition of gentamicin 
(10 1g ml’) for the remainder of the infection. Recombinant mouse IFN-B was 
added 2h after infection as indicated. 
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Full Methods and any associated references are available in the online version of 
the paper. 
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LETTER 


METHODS 

Bacterial strains. Bacterial strains include wild-type S. typhimurium SL1344, 
GFEP* wild-type S. typhimurium $L1344 (smo22) and the following S. typhimur- 
ium mutants: ASPI-2 (ssaV::Kan) and Aflag (fljAB::Kan, fliC::Cm). 

Mice. Casp1~/~ Casp11~/~ (caspase-1 knockout), Nip3”/~ Nirc4~/~, Casp11~/~ 
and Casp1 ‘~ (Casp1 ‘~ Casp11'8) mice were backcrossed to C57BL/é for at least 
ten generations*”'°. All mouse studies were approved by the institutional animal 
care and use committees of Genentech and Stanford University. 

Animal infections. Mice (fasted for 12h) were inoculated orogastrically with 
2.4 X 107-1 X 10° c.f.u. of wild-type or GFP* wild-type S. typhimurium SL1344. 
Tissues were collected at day 4 after infection, homogenized, and dilutions were 
plated on Luria-Bertani (LB) agar containing 100 1g ml! streptomycin. Bacterial 
numbers are expressed as c.f.u. g | tissue. For in vivo gentamicin protection 
experiments, mice were infected as above and injected intraperitoneally with 
1 mg of gentamicin in 0.2 ml sterile PBS or PBS alone at 48h, 24h and 12h before 
being euthanized. Bacterial counts were analysed as above. 

Cell culture and infections. BMDMs were differentiated as described previ- 
ously’*. S. typhimurium was grown to stationary phase overnight in LB broth at 
37 °C with aeration. Cells were infected at a multiplicity of infection (m.o.i.) of 
100:1 and centrifuged for 15 min at 500g to ensure comparable adhesion of the 
bacteria to the cells. Gentamicin (100 1g ml ') was added 60 min after infection. 
Cells were washed 120 min after infection followed by addition of gentamicin 
(10 1g ml~') for the remainder of the infection. Recombinant mouse IFN-f was 
added 2h after infection as indicated. 

Histological analysis. Livers were collected from infected mice at day 4 after infec- 
tion and immediately placed in buffered formalin (R&D). Paraffin embedding, H&E 
staining and Gram-staining was done by Histo-Tec Lab. For immunoflurescence 
paraffin was removed from paraffin-embedded tissue sections with xylene and 
graded ethanol baths. The tissue was stained with rabbit anti-Salmonella antibodies 
and phalloidin. Tissue was imaged with a Zeiss LSM700 confocal microscope. 
FACS analysis. Spleens from infected animals were aseptically removed and 
crushed between two glass slides in RPMI containing 10% (v/v) heat-inactivated 
FBS, 25mM HEPES, 2mM t-glutamine, 1mM sodium pyruvate and 55 pM 
2-mercaptoethanol. Single-cell suspensions of spleens were obtained by passages 
through 70-1m filters. Red blood cells were lysed in 175 mM ammonium chloride, 
10 mM phosphate buffer, pH 7.0. Two million cells were stained per sample. Dead 
cells were stained using a live/dead fixable dead cell stain kit (Invitrogen). Cells 
were washed in FACS buffer and rat anti-mouse CD16/CD32 (BD Biosciences) 
was added to block FcIII/IIR before staining with analytical antibodies. Cells were 
then stained for 30 min on ice with anti-Ly6-G (clone 1A8, BioLegend), anti-F4/80 
(eBioscience) and anti-CD11b (BD Biosciences) antibodies. Stained cells were 
washed twice before flow cytometric analysis. Data were collected on a LSR II 


(BD Biosciences) at the Stanford University shared FACS facility, and the data 
were analysed with FlowJo software (Treestar). 

Cell culture and infections. BMDMs were differentiated in DMEM (Invitrogen) 
with 10% (v/v) FCS (Thermo Fisher Scientific), 10% macrophage colony- 
stimulating factor (L929 cell supernatant), 10 mM HEPES (Invitrogen) and non- 
essential amino acids (Invitrogen). BMDMs were seeded into 6-, 24- or 96-well 
plates at a density of 1.25 X 10°, 2.5 X 10°, or 5 X 10* per well. For all infections S. 
typhimurium was grown to stationary phase overnight in LB at 37°C with 
aeration and the BMDMs were infected at an m.o.i. of 100:1. The plates were 
centrifuged for 15 min at 500g to ensure comparable adhesion of the bacteria to 
the cells. Gentamycin (100 jg ml; Sigma-Aldrich) was added 60 min after infec- 
tion to kill extracellular bacteria in the cultures. At 120 min after infection, the cells 
were washed once with DMEM and given fresh macrophage medium containing 
10g ml! gentamycin for the remainder of the infection. Recombinant mouse 
IFN-B (Sigma) was added 2h after infection when necessary. 
Immunofluorescence. BMDMs were seeded onto glass coverslips and infected as 
described above. Coverslips were fixed with 4% paraformaldehyde and stained 
with rat anti-ASC (Genentech), rabbit anti-mouse caspase-1 p10 (Santa Cruz 
Biotech) and 4’,6-diamidino-2-phenylindole (DAPI). Cells were imaged with a 
Zeiss LSM700 confocal microscope. 

Cytokine and LDH release measurement. IL-1 was measured by ELISA (R&D 
systems). LDH was measured using CytoTox 96 (non-radioactive cytotoxicity 
assay, Promega). To normalize for spontaneous lysis, the percentage of LDH 
release was calculated as follows: (LDH infected — LDH uninfected/LDH total 
lysis — LDH uninfected) x 100. 

Western blotting. The caspase-1 p10 subunit, caspase-11 p30 and processed 
IL-1 released into the culture supernatant were determined by western blotting. 
Macrophages were washed with plain pre-warmed DMEM lacking serum and 
phenol red 6 h after infection. The cells were then cultured in this DMEM lacking 
serum and phenol red until 17 h after infection. The supernatant was collected and 
precipitated with 10% trichloroacetic acid (v/v) for 1 h onice. Precipitated proteins 
were pelleted at 20,000g for 30 min at 4 °C, washed with ice-cold acetone, air-dried, 
resuspended in SDS-PAGE sample buffer and heated to 95 °C for 10 min. Protein 
from 2.5 X 10° macrophages was loaded per well of a 14% acrylamide gel. Western 
blots were performed with rat anti-mouse caspase-1 antibody (4B4; Genentech) 
diluted 1:1,000, rat anti-mouse caspase-11 (17D9; Sigma) at 1:500, rabbit anti-IL- 
18 (Biovision) at 1:500 and goat anti-mouse IL-1 antibody (AF-401-NA; R&D 
Systems) diluted 1:500. Cell lysates were probed with anti-B-actin antibody 
(Sigma) at 1:2,000. 

Statistical analysis. Statistical data analysis was done using Prism 5.0a (GraphPad 
Software). Statistical significance was determined by the Mann-Whitney U-test or 
Student’s t-test. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/naturel 1424 


Corrigendum: Past extreme 
warming events linked to massive 
carbon release from thawing 


permafrost 


Robert M. DeConto, Simone Galeotti, Mark Pagani, 
David Tracy, Kevin Schaefer, Tingjun Zhang, David Pollard 
& David J. Beerling 


Nature 484, 87-91 (2012); doi:10.1038/nature10929 


In the online-only Methods of our Letter, several erroneous values 
appear among the parameters used to calculate peatland D, the 
average thickness of permafrost peatland deposits. Catotelm decay 
rate (t) used in our equation 3 was incorrectly listed as 1,500 years 
instead of 15,000 years. p,, the carbon flux from the acrotelm into the 
catotelm (kgm yr’), appeared as 0.27 + 0.19kgm “yr instead 
of 0.027 + 0.019kgm yr’. Using the correct values has no effect 
on the calculation of peatland D, because the decimal point errors 
cancel in equation 3. In addition, peat carbon density (p) listed in 
Supplementary Table 1 was incorrect and shown as peat density 
instead of peat carbon density. Using the correct value for p in equa- 
tion 3 doubles the estimated average thickness of permafrost peat 
deposits D, but does not change the calculated carbon inventories 
discussed in the text or in Table 2. These errors do not affect the 
conclusions of the Letter. All values of model parameters used in 
equation 3 have been corrected in the HTML and PDF versions 
online. The Supplementary Information has been corrected online. 
We thank C. J. Williams for pointing out the decimal point error 
associated with p. and S. Frolking for noting the cancelling decimal 
point errors associated with p, and t, and the erroneous value of peat 
carbon density (p) and its effect on peatland D. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/naturel1515 


Corrigendum: Mitochondrial DNA 
that escapes from autophagy causes 
inflammation and heart failure 


Takafumi Oka, Shungo Hikoso, Osamu Yamaguchi, 
Manabu Taneike, Toshihiro Takeda, Takahito Tamai, 
Jota Oyabu, Tomokazu Murakawa, Hiroyuki Nakayama, 
Kazuhiko Nishida, Shizuo Akira, Akitsugu Yamamoto, 
Issei Komuro & Kinya Otsu 


Nature 485, 251-255 (2012); doi:10.1038/nature10992 


In this Letter, several images were mistakenly switched or duplicated 
during preparation of the artwork. In Figs 1f and 2a, the sham- 
operated Dnase2a_/~ and TAC-operated Dnase2a*'* mice panels 
were switched. In Fig. 4d, the panel showing CD3 staining for 
ODN2088 control-treated TAC-operated Dnase2a‘’* mice (now 
shown correctly as black-bordered panel in Fig. 1 below) is a duplicate 
of that showing Ly6G staining for ODN2088-treated TAC-operated 
Dnase2a'~ mice. The panel showing CD45 staining for ODN2088- 
treated TAC-operated Dnase2a*'* (now shown correctly as blue- 
bordered panel in Fig. 1 below) was prepared from the original picture 
of ODN2088 control-treated TAC-operated Dnase2a*'*. In Sup- 
plementary Fig. 4c, sham-operated Dnase2a_'‘~ and TAC-operated 
Dnase2a*'* mice panels were switched. Finally, in Supplementary 
Fig. 10d, the panels showing CD3 and Ly6G staining for sham- 
operated TIr9*’* mice were switched. These corrections do not alter 
any of the conclusions of this Letter, and the authors apologize for any 
confusion these errors may have caused. 
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Figure 1 | This is the corrected Fig. 4d of the original Letter. 
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CORRECTIONS & AMENDMENTS 


ERRATUM 
doi:10.1038/nature11453 


Erratum: Integrated genomic 
analyses of ovarian carcinoma 


The Cancer Genome Atlas Research Network 


Nature 474, 609-615 (2011); doi:10.1038/nature10166 


In this Letter, J. G. Vockley was incorrectly listed as J. B. Vockley. Also, 
several authors were inadvertently omitted from the genome sequencing 
centre group: C. J. Creighton, from the Human Genome Sequencing 
Center at the Baylor College of Medicine; R. Nichol and S. Fisher, from 
the Sequencing Platform at the Broad Institute; and E. Upfal, from the 
Department of Computer Science at Brown University. In addition, 
J. Samayoa, from the Institute for Computational Medicine, was 
inadvertently missing from the genome data analysis centre group at 
the Johns Hopkins University. These have been corrected in the HTML 
and PDF versions of the manuscript. 
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MENTAL HEALTH 


Under a cloud 


Depression is rife among graduate students and postdocs. 
Universities are working to get them the help they need. 


BY VIRGINIA GEWIN 


| auren was always a top student, but the 


pressures of her first year studying for a 

PhD in atmospheric chemistry at a UK 
university sent her spiralling into depression. 
At best, she couldn't focus on academic tasks, 
feeling as if her brain was “scrambled”; at 
worst, she couldn't get out of bed. 


She developed a crippling fear of present- 
ing her research. “Doing a PhD is such a 
personal thing, one that you've invested so 
much time in, that any criticism can feel like a 
direct reflection of yourself? says Lauren. 

But she did something that many postgradu- 
ates do not: she got help. With counselling and 
medication, Lauren — a pseudonym that she 
uses on a blog detailing her experience (see 


go.nature.com/4ta9fo) — is entering the final 
year of her PhD. Hers is one of more than 50 sto- 
ries highlighted on the website Students Against 
Depression, funded by the Charlie Waller 
Memorial Trust in Thatcham, UK. “The website 
aims to raise awareness that depression isn't a 
personal failing or weakness; it's a serious condi- 
tion that requires treatment,’ says psychologist 
Denise Meyer, the website's project manager. 

For early-career scientists, competing aca- 
demic demands simmer in a stew of isolation, 
high expectations and sleeplessness that can 
boil over into debilitating depression, agoniz- 
ing bouts of anxiety or even suicide attempts. 
Even if students feel that they can handle the 
isolation and stress of a graduate programme, 
extra stresses, such as problems in a relationship 
with an adviser or a partner, can tip them over 
the edge. It is important to understand the signs 
of depression and anxiety, know what resources 
are available on and off campus, and have an 
idea of what to expect from counselling. 


RECOGNIZING THE PROBLEM 

Statistics specific to graduate students are hard 
to come by, but surveys’ suggest that rates of 
depression have doubled among all US col- 
lege students over the past 15 years, and inci- 
dence of suicidal behaviour has tripled. The 
best estimates are that about 10% of US college 
students’, and 4% of all UK university stu- 
dents’, seek treatment. And the vast majority of 
mental-health disorders, from schizophrenia to 
bipolar disorder, manifest during the teens and 
twenties: the college and postgraduate years. 

It can be difficult for graduate students to 
recognize that they need help. Stress is normal 
— especially during training. But changes in 
personality or behaviour can signal problems 
in need of attention (see ‘Signs to heed’). 

“Depression can have cognitive impacts that 
interfere with concentration and learning — 
things that go to the heart of being a student 
— making it more difficult to recognize this 
as depression,’ says Gordon Strauss, director of 
student mental-health services at the Univer- 
sity of Louisville in Kentucky. “Students trying 
to assess whether they are depressed should 
ask themselves if they are able to perform 
research, be productive and derive satisfaction 
from their research” 

These changes are especially tough for > 


Z STRESS AND RESILIENCE 


The links between adversity and 
mental illness. nature.com/stress 
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> students who, like Lauren, have previously 
been high achievers. “I’ve seen graduate stu- 
dents, who have been academically successful 
all their lives, get to graduate school and experi- 
ence academic failure for the first time — which 
can come as quite a shock and fuel feelings of 
anxiety,” says Karen Peterson, director of the 
office of scientific 
career development 
at the Fred Hutchin- 
son Cancer Research 
Center in Seattle, 
Washington. 
Perhaps not sur- 
prisingly, one of the 
main concerns for 
graduate students is 
whether they come 
to believe that they 


have chosen to spe- “If there isa 
cialize in the wrong long wait for 
field. “We see lots of treatment, 
students who have people tend 
this concern yet cant not to follow 
imagine switching through.” 
fields because they Ronald Albucher 


are so far along,’ says 

Ronald Albucher, director of counselling and 
psychological services at Stanford University 
in California. “These are legitimate concerns 
that we recommend talking through with 
friends, family or counsellors.’ 


SEEKING HELP 
Getting help can prove challenging. Stigma 
remains an issue, although more in some cul- 
tures than in others. “In Italy, very few gradu- 
ate students ask for psychological help, because 
of the judgement that goes along with it,” says 
Paolo Valerio, coordinator of the Naples Uni- 
versity Centre of Psychological Consultation, 
part of a campus populated by more than 
90,000 students. But there are other barri- 
ers, too. “Students have competing priorities, 
from academic deadlines to social activities, 
that make it easy to put off caring for yourself 
— whether consciously or unconsciously,” says 
Daniel Eisenberg, a health-policy researcher at 
University of Michigan in Ann Arbor. 

Students will be better prepared if they know 
what services are available. “When a student 
moves to a new place, it’s a prudent idea to fig- 
ure out how to get all kinds of health care — 
including mental-health support,” says Victor 
Schwartz, medical director of the Jed Founda- 
tion, a non-profit group in New York that aims 
to promote emotional health and prevent sui- 
cide among students. Graduate students often 
have access to undergraduate services; postdocs 
may be able to use staff resources. However, 
the most recent postdoc survey" by Sigma Xi, 
a scientific-research society based in Research 
Triangle Park, North Carolina, suggests that 
almost 40% of postdocs don’t know whether 
counselling is available at their university. 

“In times of crisis, it’s hard to think straight?” 


says Julie Gold, a professional-development 
coach with the Office of Intramural Training 
and Education at the US National Institutes 
of Health (NIH) in Bethesda, Maryland. “I 
encourage incoming trainees to call the coun- 
selling centre to find out the hours and how to 
get an appointment.” Gold also suggests that 
programme directors discuss mental-health 
resources with students when they start and 
throughout the semester. 

Most US and UK universities have some 
mental-health support on campus, but it is 
usually aimed at undergraduates. In main- 
land Europe, services are spottier: one-third 
of higher-education institutions don’t offer 
counselling. And the quality of services can 
differ widely. “If you’ve seen one campus 
mental-health programme, you've only seen 
one campus mental-health programme; it’s 
extraordinarily variable,’ says Jerald Kay, chair 
of psychiatry at Wright State University School 
of Medicine in Dayton, Ohio. 


COST MATTERS 
In the United States, where treatment can be 
expensive, a student's decision to seek mental- 
health help “boils down to an issue of funding 
priorities’, says Kay. In 2006, the University of 
Louisville imposed a surcharge on tuition fees 
to expand the student health service and make 
psychiatric help available to more than 20,000 
students. “The psychiatric services I provide 
are covered by the health fee so neither insur- 
ance nor parents are billed,” says Strauss. 

Similarly, Roger Chalkley, an associate dean 
for education at Vanderbilt University in Nash- 
ville, Tennessee, has created a free, confidential 
service for his university’s 1,100 biomedical 
postgraduate students. 

Seeking help earlier rather than later is 
a good approach, say counsellors — par- 
ticularly because many medications can take 
weeks or months to become effective. Stan- 
ford University has increased services for its 
8,500 full-time graduate students and 7,000 


SYMPTOMS 
Signs to heed 


There are many signs of depression 
and anxiety — the two most common 
mental-health disorders among 
postgraduates. Anyone who experiences 
one or more of the following symptoms 
should consider seeking help: 

@ Inability to attend class or do research 
@ Difficulty concentrating 

@ Decreased motivation 

@ Increased irritability 

@ Sleep disturbances — insomnia, or 
sleep is no longer restorative 

@ Changes in appetite or energy levels 
@ Increased social withdrawal V.6. 
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undergraduates to ensure that people are seen 
more quickly. “If there is a long wait, people 
tend to not follow through,” says Albucher. 

One-third of US university mental-health 
centres place limits on the number of counsel- 
ling sessions allowed. And.as demand increases 
among UK students, university counselling ser- 
vices have also started offering a set number of 
sessions, says Patti Wallace, lead adviser for uni- 
versity and college counselling with the British 
Association for Counselling and Psychotherapy, 
based in Lutterworth. Once the limit is reached, 
counsellors will usually refer students who still 
need treatment to other local services. 

When campus centres are overbooked or 
inadequate, outside services can prove use- 
ful (see “Where to get help’). This year, the US 
National Graduate Student Crisis Line (800- 
472-3457) has received 1,140 calls: triple the 
total in 2011. The rise has been fuelled in part 
bya mention in The Chronicle of Higher Educa- 
tion in August. “The hotline exists because of 
the concern that graduates students may not 
want to seek help locally for fear of appearing 
weak,” says Nick Repak, executive director of 
Grad Resources in Plano, Texas, which con- 
tracts and trains the Boys Town National Hot- 
line to run the service. 


COUNSELLING EXPECTATIONS 

Students may be unclear on what counsel- 
ling sessions will entail. Some are surprised to 
learn that many issues can be resolved quickly. 
“Many of the students I see come for only one 
session to talk through difficulties and receive 
suggestions on how better to manage stress,” 
says Wallace. “We aim to get people function- 
ing effectively as quickly as possible.’ Wallace 
focuses on offering pragmatic strategies to deal 
with depression, anxiety and relationship prob- 
lems, but she says that each situation is unique, 
and it is often difficult to work out to what 
extent troubles are attributable to academic 
work or personal life. 

A greater concern might be confidentiality, 
which is a top priority at counselling centres. 
“Everything students tell me is private and 
confidential — even if they were referred to 
me by an adviser or member of the faculty,” 
says Strauss. Counsellors will break confidence 
only if they feel that a patient presents immi- 
nent harm to themselves or others. 

“It’s important for students to understand 
that the counsellors offering treatment are not 
the agents of the university — they are there at 
the behest of the student's welfare,” says Kay. 

However, Albucher points out that there can 
bea loophole: if students are still using a parent 
or guardian’s health insurance, an explanation 
of services charged to the student might be sent 
home by the insurance company, inadvertently 
breaking the news about the treatment. 

Postgraduates often grapple with whether to 
disclose mental-health issues to their advisers. 
“You have to be realistic that some supervi- 
sors are not necessarily the person to discuss 
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RESOURCES 
Where to get help 


In the United States: 

@ Grad Resources 
www.gradresources.org 

@ The Jed Foundation 
www.jedfoundation.org 

@ ULifeline 

www.ulifeline.org 

@ Campus Mind Works 
www.campusmindworks.org 
In the United Kingdom: 

@ Mental Wealth 
www.mentalwealthuk.com 

@ Student Counselling 
www.student.counselling.co.uk 
@ Students Against Depression 
www.studentsagainstdepression.org 


your worries with,’ says Sharon Milgram, 
director of the NIH Office of Intramural 
Training and Education. Wallace suggests 
that postgraduates talk to a university-based 
counsellor about what can be gained by con- 
fiding, and what level of detail is appropriate. 
“Tf a postgraduate feels some level of disclo- 
sure would help their adviser understand 
their recent behaviour, it may suffice to sim- 
ply say, Tve been having problems and 'm 
getting help for it;” says Wallace. 

Data from the United States and the 
United Kingdom suggest that counselling 
helps the vast majority of students to get 
past personal problems and excel. To be 
successful, people must take care of them- 
selves, notes Milgram. “Getting help,” she 
says, “is a sign of strength, not weakness.’ m 


Virginia Gewin is a freelance writer based 
in Portland, Oregon. 
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CORRECTION 

The story ‘A voice for the voiceless’ 
(Nature 489, 461-463; 2012) incorrectly 
stated that the US National Postdoctoral 
Association (NPA) is working with 

the Association of Public Land-grant 
Universities to develop a certification 
programme. The NPA is developing the 
programme on its own but will seek 
support and input from the university 
association. 


COLUMN 


Advising on the edge 


Interdisciplinary mentorship must evolve to keep pace 
with innovative programmes, argues Katherine Mackey. 


ore than ever, scientists are working 
M across disciplines. Addressing and 

analysing climate change, sustain- 
able development and genomic data requires 
unusual and interdisciplinary approaches. As 
problems and solutions co-evolve in unex- 
pected ways, skill sets must evolve too — as 
must mentorship. Mentors should work to 
bring early-career scholars up to speed on 
issues such as funding structures and com- 
munication styles tailored to interdisciplinary 
efforts — and protégés should be proactive. 

Interdisciplinary work often takes root in an 
established department. Scholars must func- 
tion in the existing framework, even when the 
goals, timelines and deliverables of the research 
differ. So when the research grows into its own 
programme, the first scholars to seek tenure 
in it will have no mentors who have navigated 
the same process. As early-career researchers 
attempt to break new ground, they must seek 
the few interdisciplinary mentors who can help 
them to find their bearings. 

Young scholars should cast a wide net. Men- 
tors from other disciplines may provide fresh 
perspectives, and their enthusiasm for collabo- 
rating can build an early-career researcher’s 
confidence. But interdisciplinary scholars still 
need strong mentors in their own programmes, 
who are sensitive to extra-departmental obliga- 
tions and can help them to keep pace with dead- 
lines and degree or tenure requirements. 

To ensure that their protégés get used to the 
culture of multiple disciplines, mentors should 
help students to identify and forge collabora- 
tions with other experts. This might include 
arranging training in different laboratories 
or universities, to build networks from which 
interdisciplinary progress can grow. 


SPECIAL CHALLENGES 

Logistics set interdisciplinary mentorship 
apart. For students, the need for course work 
and training in both a disciplinary department 
and an interdisciplinary programme can dou- 
ble early-stage time commitments and delay 
dissertation projects. It is essential for advisers 
and committee members to communicate with 
the student and each other to prioritize train- 
ing objectives. 

Early-career researchers on the tenure track 
face more serious challenges: owing to lack of 
precedent, institutional variability and multi- 
disciplinary tenure committees, expectations 


may not be well established. How many 
first-authored papers should be written per 
year? By when should the first grant be funded? 
How many courses should be developed and/ 
or taught each year? All of these milestones can 
vary between disciplines. 

Committee or department chairs should 
issue annual progress reports to ensure that a 
scientist's research goals mesh with the depart- 
ment’ needs. This gives the scholar a chance to 
adapt their research early on, to include angles 
or activities that the department values. 

The challenge isn't just about understand- 
ing benchmarks, but also about finding ways 
to receive credit for unusual, difficult-to- 
evaluate work such as science-policy state- 
ments, decision-support systems and outreach 
tools. And it may not be clear whether or how 
extra-departmental courses, multi-authored 
projects or shared graduate students (who gets 
credit for mentoring them?) will count towards 
tenure requirements. 

Mentors can clarify how these achievements 
will be ‘scored’ during the tenure process. And 
they should strive to help protégés to under- 
stand the opinions and expectations of com- 
mittee members by, for example, facilitating 
discussions about how progress is charted. 

As interdisciplinary scholars look for inno- 
vative solutions, their mentors must also 
develop innovative approaches. This will help 
the protégés to enrich their education and earn 
useful degrees — and chart a course towards a 
successful career. m 


Katherine Mackey is a US National Science 
Foundation biology fellow at the Woods 

Hole Oceanographic Institution and the 
Marine Biological Laboratory in Woods Hole, 
Massachusetts. 
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